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New integrated circuits are usually accom- 
panied by a wealth of theory and data 
sheets. Shortly thereafter follow the applica- 
cation notes. The introduction of micropro- 
grammable LSI parts, such as the Am2901 
and subsequent ICs in the family, adhered to 
this pattern. We thought this was adequate 
in light of the previously successful intro- 
duction of fixed-instruction-set MOS micro- 
processors, which were more complex. 

However, bit-slice microprocessor design 
proved more formidable than first realized. 
One reason was the intimate relationship 
between parts. These designs required the 
designer to pick and choose parts: How 
many slices are needed to do the job? 
Which microprogram sequencer and/or con- 
troller to select? Is a carry lookahead gen- 
erator needed? And on, and on and. ... All 
these devices had to play together; no single 
device was complete by itself. 

For this added up-front design effort, the 
user got blazing speed and the utmost 
flexibility. The latter proved the second hin- 
derance to easy designing. Users now had 
to design the instruction set as well as the 
hardware and applications programs. They 
no longer had the luxury of a fixed-instruction 
set. On the other hand, they could eliminate 
unnecessary instructions, easily modify or 
add instructions at a later date or emulate 
the existing instruction set of a slower CPU. 

Complicating matters was the fact that the 
2900 family did not spring whole into the 
world. Parts were introduced and redesigned 
over a period of years as engineering and 
processing resources could be brought to 
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bear. This evolutionary process still goes on. 
To alleviate matters, Advanced Micro 


- Devices announced a nine-part course in 


microprogrammable microprocessing, each 
part to standalone but tobuild logically upon 
the preceding part. And, because engineer- 
ing talent is our most important resource, 
this course was to unfold over a 22-month 
period. 

Since completion of the course, there has 
been no diminishing in demand for informa- 
tion on the material covered. In fact, the 
market for bipolar microprogrammable LSI 
parts doubled in each of the previous two 
years and showed no signs of slowing. So, 
as our copies of individual course materials 
dwindled, we thought it only natural to bring 
them all together under one cover. This book 
is the result. 

We think the extraordinary time and ef- 
fort was well worth it. 
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Computer Architecture 


PREFACE 


In this introductory Chapter we intend to: 

1). develop a common terminology for future chapters. 

2). introduce several stored-program-computer design topics. 

3). define some of the computer architect's problems (which 
will be solved in the subsequent chapters). 


In order to achieve these goals, we will start with computer 
basics. It should be stressed that approaches and solutions 
can be chosen which are different from the ones described in 
this and the subsequent chapters. However, the general ideas 
described will be appropriate to gain familiarity with the micro- 
programmable bit-slice devices in order to use them in any 
design configuration. 


BACK TO THE BASICS. .. 


A STORED-PROGRAM-COMPUTER is defined as a machine 
capable of manipulating data according to predefined rules 
(instructions), where the program (collection of instructions) 
and data are stored in its memory (Fig. 1). Without some 
means of communication with the external world, the program 
and the data cannot be loaded into the memory nor can the 
results be read out. Therefore, an input/output device is re- 
quired as shown in Fig. 2. 
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Figure 1. Basic Definition of a Stored-Program-Computer. 
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Figure 2. I/O Added to the Basic Stored-Program 
Computer. 








The memory is usually organized in words, each containing N 
bits of information. A unique address is allocated for each 
word which defines its position relative to other words. The 
Central Processor Unit (CPU) usually reads or writes one 
word at a time by addressing the memory and then when the 
memory is ready, reading the contents of the word or writing 
new contents into that word. To perform this operation, two 
registers are usually used: The Memory Address Register 
(MAR), which contains the address and the Memory Data 
Register (MDR) which contains the data (Fig. 3). 
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Figure 3. MAR and MDR Depicted for a 
Stored-Program Computer. 


Since accessing a memory (reading from it or writing into it) is 
usually a relatively slow procedure, it is advantageous to have 
a few memory locations inside the CPU which can be read 
from or written into very fast. These locations are usually 
called Accumulators or Working Registers. Having these fast 
access registers inside the CPU (Fig. 4) enables many opera- 
tions to be carried out without referring to the memory 
(through the MAR and the MDR) and therefore these opera- 
tions are executed faster. 


The unit which actually performs the data manipulation is 
called the Arithmetic & Logic Unit (ALU). It has two inputs for 
operands and one output for the result. It usually operates on 
all the bits of a word in parallel. The ALU can perform all or 
part of the following operations: 


Arithmetic Logical 
Add OR 
Complement AND 
Subtract XOR 
Increment NAND 
Decrement NOR 
XNOR 
Complement 


In some architectures, one of the operands must always be in 
a special register (accumulator) and the result of the ALU op- 
eration is always transferred to this register. In a more general 
CPU, any two of the internal registers can contain the 
operands and the result of the ALU operation can be trans- 
ferred to any one of them. 


Another very useful feature of a CPU is the ability to shift the 
contents of a register or the output of the ALU one or more 
bits in either direction as shown in Fig. 5. 
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Figure 4. CPU with Internal High Speed Registers. 
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Figure 5. ALU and Shifter Added to the CPU Design. 


We now have the elements to do any data manipulation re- 
quired but we still need a unit which can properly set the MAR 
in order to find the next instruction of the program in the 
memory and to find its associated data. This unit is called the 
Program Control Unit (PCU) and its role is to load the MAR 
with the correct address in order to find the next instruction or 
data item or to point to a memory location where a data word 
should be written. 


Often, the program steps (instructions, data) are written in the 
memory in consecutive locations, starting at address zero or 
at any other predefined address. The PCU can simply be in- 
cremented after each memory access thereby pointing to the 
address of the next instruction or data item. This counter-type 
PCU has very little flexibility. Sometimes we wish to change 
the “normal” flow of the instructions, particularly if we want to 
enable our computer to “make decisions” according to condi- 
tions prevailing at the current execution point. For example, 
we may want to execute one of two different sequences of in- 
structions depending upon the result of the last operation per- 
formed. This is accomplished by loading the MAR with a new 
value (the address of the next instruction to be executed) 
rather than incrementing it. This operation is called a 
BRANCH or JUMP and can be unconditional (which allows 
execution of a non-contiguous string of instructions) or condi- 
tional (depending, for example, on whether the last opera- 
tion’s result was zero or not, was negative or positive, true or 
false, etc.). 





Even more flexibility can be achieved by using a stack (a 
group of temporary internal or external memory locations) to 
store vital data. A stack pointer is used to address the mem- 
ory location currently at the top of the stack. Indirect and rela- 
tive addressing and other sophisticated addressing modes (all 
of which can be handled by the PCU) will be discussed later. 
Meanwhile, Fig. 5 shows the PCU as a part of the CPU. 


Executing an instruction in our computer now requires the 
following steps: 


a). The PCU loads the address of the next instruction to the 
MAR and signals to the memory that a Read is re- 
quested. Incidentally, the PCU may be as simple as a 
Program Counter equal to the address width. The mem- 
ory loads the MDR with the contents of the location ad- 
dressed. 

b). The CPU decodes the instruction: i.e., (assuming 
operands are in internal registers) selects the proper reg- 
isters to feed the ALU, selects the proper function to be 
performed by the ALU, sets up the shifter to displace the 
result, if required, and selects the register in which the 
result should be stored. 

c). The ALU performs the function desired. 

d). The result is loaded into the destination register. 

e). The result is also examined to determine whether a 
BRANCH is to be performed. 

f). The PCU calculates the address of the next instruction, 
(usually called a “FETCH”). 


This procedure becomes more complicated if the operands 
are not stored in the internal registers or if the result is not to 
be stored in one of them. Let’s take an example instruction 
using relative addressing: 


“Take the first operand from the location specified by the 
sum of the word after this instruction (immediate) and the 
contents of register R1; take the second operand from 
the location specified by the sum of the second word 
after this instruction and the contents of R2; add the two 
operands and place the result in the location specified by 
the sum of the third word after this instruction and the 
contents of register R3. Then execute the instruction lo- 
cated at the address, which is the sum of the fourth word 
after this instruction and the contents of register R4 if 
there is a carry resulting from the addition. Otherwise 
continue sequentially”. 


The steps required to execute this instruction are as follows: 


a). The PCU loads the address of the next instruction to the 
MAR, signalling to the memory that a Read is requested. 
The memory loads the MDR with the contents of the loca- 
tion addressed. 

b). The CPU decodes the instruction, i.e., initiates the follow- 
ing steps. 

c). The PCU is incremented and the next word is read from 
the memory. 

d). Register R1 and the MDR are selected as source regis- 
ters, MAR is the destination register. 

e). The ALU performs “ADD” and the result is placed in the 
MAR. 

f). The first operand is fetched from the memory and placed, 
for example, in R5. 

g). The PCU is incremented and the next word is. read from 
the memory. 

h). Register R2 and the MDR are selected again as source 
registers and MAR as the destination. 


1). The ALU performs “ADD” and the result is placed in 
MAR. 

j). The second operand is fetched from the memory and is 
placed, for example, in R6. 

k). The PCU its incremented, the next word is read from the 
memory. 

I). Register R3 and the MDR are selected as source regis- 
ters, the MAR as destination. 

m). The ALU performs “ADD” and the result is placed in the 
MAR, which now points to the location where the sum of 
the operands should be stored. 

n). Registers R5 and R6 are selected as sources (they con- 
tain the operands), MDR is now the destination. 

o.) The ALU performs “ADD” and the result is placed in 
MDR. 

p). A memory write cycle takes place and the contents of the 
MDR is stored at the desired address. 

q). The carry is examined to determine the next step to be 
performed. Assume there is no carry. 

r). The PCU is incremented twice (in order to skip the fifth 
word of the present instruction). It now points to the ad- 
dress of the next instruction. 


As can be seen, 18 steps were used to perform a single ad- 
dition using this complex relative addressing scheme. Obvi- 
ously, our CPU needs some kind of “coordinator” which can: 


1). Decode an instruction fetched from the memory. 

2). Initiate the proper cycle of steps to be performed. 

3). Set up the various controls for each step. 

4). Execute the steps in an orderly sequence. 

5). Make decisions according to the state of various signals 
(conditions). 


We will call this coordinator the Computer Control Unit (CCU) 
and it is depicted in Fig. 6. Our CPU is now complete (more 
or less) and we will go into more detail later. 





THE MEMORY 


Let’s now discuss the memory. The information stored in the 
memory is organized in words, where each word consists of N 
bits. N may be as small as 8 for very simple processors or as 
large as 64 in more powerful machines. The most common 
memory width for minicomputers is 16 bits. The number N is 
called the width of the memory and the number of bits in the 
MDR Is obviously also N; equal to the width of the memory. 


The depth of a memory is the number of words it contains. 
With a MAR having k bits, 2* consecutive memory locations 
can be addressed. The addresses start from zero and range 
through 2-1. 


The read access time of a memory directly accessible by the 
CPU is the time needed from stable address at the memory 
until the data is properly stored in the MDR. This access time 
depends on the type of memory used and can be as low as a 
few tens of nanoseconds and as large as several micro- 
seconds. Using high speed memory improves the perfor- 
mance of the computer as less time is wasted waiting for the 
memory to respond. In general, faster memories are costly, 
take more PC board area and use more power which results 
in more heat. A 32 bit wide, 2K (2048) word memory with 50 
nanosecond access time may need 10 amps from the +5V 
power supply and may require a board area of 10” x 6”. Yet 
this is a very small memory space. 


It is usually not justified to have very large high-speed 
memories. Not all the programs and associated data need to 
reside in this memory at once. We may have the current pro- 
gram (or only a part of it) in the memory while other programs 
or data files can reside elsewhere and be brought into mem- 
ory during the appropriate part of the program when needed. 
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Figure 6. A Computer Control Unit (CCU) Included in a CPU. 


This “elsewhere” may be a magnetic tape, cassette, disk, 
diskette, etc. and we will call it Bulk Memory. The distinctive 
characteristics of Bulk Memory are: 

1). very large capacity 

2). non-volatile (retains the information when not in use) 

3). not randomly accessible 

4). long access time 

5). inexpensive (per bit) 


Usually, Bulk Memory devices are serially accessible, i.e., the 
access time for the first word is large, but then consecutive 
words can be accessed relatively fast. 


In a later chapter the most efficient process of communication 
between the main and the bulk memory, called the Direct 
Memory Access (DMA), will be discussed in detail. 


THE EXTERNAL WORLD 


In any useful machine, some means of communicating with 
the external word is needed. It may be a keyboard, a CRT, a 
card reader, a paper tape punch or, in a process controller, 
reading sensors or positioning actuators. The common de- 
nominator of almost all of the input/output devices is that they 
are much slower than the CPU and therefore a timing prob- 
lem arises; the CPU must know when the I/O device is ready 
for data transfer. Usually, a signal is sent by the device to the 
CPU in order to draw its attention. The CPU now can do one 
of two things: 


1). Test this signal periodically and when it is present, jump 
to a program which handles the data transfer. This type 
of operation is called “Polling”. Tnis technique has two 


major drawbacks: First, appreciable computer time is 
spent performing these periodic tests where most of them 
will fail (no “Ready” signal present). Second, the recogni- 
tion by the computer CPU of the appearance of a signal 
is delayed until the CPU arrives at this device in its pol- 
ling sequence. 


Imagine what will happen if there are a large number of 
I/O devices. Long latency times (delays) will occur if 
many I/O devices are busy simultaneously. 


2). Include some hardware in the CPU which can sense the 
presence of a “Ready” signal and interrupt the normal 
flow of the instructions and force the computer to “Jump” 
to the I/O service program whenever there is a request. It 
can even send the CPU to different programs according 
to the I/O device whose “Ready” flag was detected and 
even establish priority among the different devices if more 
than one device would like to have the CPU’s attention at 
the same time. Moreover, under program control, this cir- 
cuitry can ignore some or all of the signals if the com- 
puter CPU must not be interrupted at that time. Obviously 
by paying the price of very little hardware, we gain 
enormously in computer performance. We will call this 
hardware the “Interrupt Controller’ and will discuss it 
thoroughly later. 


Our computer is now depicted in Fig. 7. We have included the 
ALU, the internal register file and the shift circuit in one block, 
which we call the “Arithmetic Processor Unit.” 


In the following pages and in the subsequent chapters, we will 
deal in more detail with each area of the machine. 
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Figure 7. The Stored-Program-Computer with DMA and Interrupt Control Added. 


A WORD ABOUT THE INSTRUCTION SET | 


The internal architecture of the CPU depends to some extent 
on the instruction set the computer is to execute. If the in- 
struction set is large, some of the instructions usually are 
more complicated and the computer is more powerful, faster 
and more efficient. On the other hand, the internal circuitry is 
also more complicated. Some examples of these tradeoffs are 
as follows. 


ALU Processing Capability: 


Although with three basic functions (add, complement, and 
OR/AND) all the arithmetic and logic operations can be per- 
formed, most processors are built to perform subtract, NAND, 
XOR, etc. This is perhaps the most outstanding example of 
how performance and speed can be gained with little penalty 
on the complexity of the machine. With the added features an 
XOR operation can be performed in one instruction instead 
of 5. 


Data Movement: 


Let us assume 4 different computers whose data movement 
Capabilities are described below: 


Machine A). A word can be read from the memory and 
loaded into Register A only. The contents of Register A can 
be written into the memory, or can be moved into any other 
register. The contents of any register can be copied into 
Register A. 


Machine B). The contents of any register can be copied into 
any other register or it can be written into the memory. A word 
read from the memory can be loaded into any register. 


Machine C). As B above but with the added capability to read 
from one location in memory, to write that word into another 
location in memory. 


Machine D). As C above and also the memory-to-memory 
operation can be performed on consecutive addresses repeti- 
tively. The number of word transfers (or upper and lower ad- 
dress limits) are specified by the instruction. 


Machine A has very limited data movement capability. In 
order to perform an operation on two operands residing in the 
memory, we have to: 


1). Bring the first operand from the memory into Register A. 
2). Copy it into another register. 

3). Bring the second operand into Register A. 

4). Perform the operation required (result in A). 

5). Store the contents of Register A into the memory. 


If consecutive operations are required with several partial re- 
sults, the drawbacks of machine A become more annoying, 
especially if the number of internal registers is small. 


Moving a data block from one location in the memory to 
another location can be performed by one instruction in com- 
puter D, but requires the transfer of each word first to an in- 
ternal register then to the new memory location in machines 
A, B (two instructions for each word transferred). 


Obviously the decoding, multiplexing and sequencing of the 
computers grow in complexity as we proceed from machine A 
to machine D. We trade the complexity of hardware versus 
the software (programming), speed and performance. 


Addressing: 
The operands for an operation can be found in several ways: 


@ The operand is an explicit part of the instruction (Im- 
mediate) 

@ The address of the operand is an explicit part of the in- 
struction. (Direct) 

@ The address of the operand is in an internal register; the 
register itself is specified by the instruction. (RR) 

e@ The address of the operand is the sum of the contents of 
an internal register (specified by the instruction) and a 
number (called the displacement) which is an explicit part 
of the instruction. (RX) 

@ The contents of an internal register are added to a 
number found in an address specified by the instruction. 
The sum is the address of the operand. (Indirect) 

e The contents of an internal register are added to a 
number which is an explicit part of the instruction. The 
sum points to the location where the address of the 
operand is written. (Indirect) 

e The contents of an internal register are added to a 
number which can be found at the location explicitly 
specified by the instruction. The sum thus formed points to 
a location where the address of the operand is written. 

e Etc. 


Many other schemes can be formed by combining the above 
operations or by chaining them. In every case an “Effective 
Address” must be found by calculations and/or memory refer- 
ences. Again, we can gain performance by using more 
sophisticated addressing schemes but we will pay for it by 
adding complexity to our machine, especially in its control por- 
tion. 


TIMING, SEQUENCING, CONTROLLING 


In the previous paragraphs we have shown that we can gain 
performance in our computer by having a more complicated 
instruction set but more complex hardware is required, usually 
in the CCU. We have also shown an example for an “Add” 
Operation which required 18 precisely controlled steps. Even if 
we assume that some of them can be performed simultane- 
ously, we will need a multiphase clock to control these steps 
— something like that shown in Fig. 8. We can now load an 
instruction register at the beginning of an instruction with the 
first word of the instruction (the OP CODE) as is shown in 
Fig. 9. Using the outputs of the Instruction Register (IRg to 
IRp-1), the different phases of the clock and the various condi- 
tion inputs to the CCU, we can now try to write the logical 
equations which should satisfy all of the steps of all the in- 
structions of our instruction set. Then use Karnough maps or 
other techniques to reduce these equations and finally realize 
them using AND, OR, INVERT gates and Flip Flops. Simple, 
isn’t it? Imagine the complexity of a sophisticated computer 
and the debugging process it needs! 


The question posed immediately is: Isn’t there a more or- 
ganized and more easily understandable way to do that? Or, 
perhaps, can we have some processor do the job for us? 
Can’t we have some kind of “micro-machine” which can take 
care of all the timing, sequencing and controlling jobs of our 
computer — a computer inside the computer? With the advent 
of the Am2900 family — new Bipolar LSI devices — the an- 
swer is: Yes, we can! 
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Figure 8. An 8-Phase Clock. 
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Figure 9. The Instruction Register Bits. 
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Figure 10. The Micromachine. 


THE MICRO-MACHINE 


What we need is essentially a machine which can execute a 
number of well defined sequences. But, remember that this is 
exactly the purpose of a stored program computer. The only 
difference between our micro-machine and a general purpose 
computer is that in the general purpose computer the program 
to be executed is changed from task to task, while in our 
micro-machine it is fixed. This allows the use of PROM for its 
memory instead of the RAM needed in the general purpose 
(GP) computer. Our Computer Control Unit (CCU) using this 
micro-machine may now look like Figure 10. 


Basically, a microprogrammed machine is one in which a 
coherent sequence of microinstructions is used to execute 
various commands required by the machine. If the machine is 
a computer, each sequence of microinstructions can be made 
to execute a machine instruction. All of the little elemental 
tasks performed by the machine in executing the machine in- 
struction are called microinstructions. The storage area for 
these microinstructions is usually called the microprogram 
memory. 


A microinstruction usually has two primary parts. These are: 
(1) the definition and control of all elemental micro-operations 
to be carried out and (2) the definition and control of the ad- 
dress of the next microinstruction to be executed. 


The definition of the various micro-operations to be carried out 
usually includes such things as ALU source operand selec- 
tion, ALU function, ALU destination, carry control, shift control, 
interrupt control, data-in and data-out control, and so forth. 
The definition of the next microinstruction function usually in- 
cludes identifying the source selection of the next micro- 
instruction address and, in some cases, supplying the actual 
value of that microinstruction address. 


Microprogrammed machines are usually distinguished from 
non-microprogrammed machines in the following manner. 
Older, non-microprogrammed machines implemented the con- 
trol function by using combinations of gates and flip-flops 
connected in a somewhat random fashion in order to generate 
the required timing and control signals for the machine. Mi- 
croprogrammed machines, on the other hand, are normally 


considered highly ordered and more organized with regard to 
the control function field. In its simplest definition, a micro- 
program control unit consists of the microprogram memory 
and the structure required to determine the address of the 
next microinstruction. 


The OP-CODE (type of instruction to be executed by the 
computer) is loaded into the Instruction Register and the In- 
struction Decoder decodes it. Actually, it generates the micro- 
address where the first step of the execution sequence for 
that instruction resides in the microprogram memory. The 
Am2910 sequencer then generates the microaddress of the 
next microinstruction. The microprogram data supplies the 
control signals we need to control all the parts of the com- 
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Figure 11. Computer Control Function Flow Diagram. 
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puter (and there are a lot of them), including the sequencer 
itself. When all the steps of a machine instruction are exe- 
cuted, the microprogram will cause the reading (fetch) of the 
next machine instruction from the computer main memory. 
Typically, the Computer Contro/ Unit is used to fetch instruc- 
tions and decode them using a PROM for mapping the op 
code to the initial address of the sequence of microinstruc- 
tions used to execute this particular instruction. It will also 
fetch all of the operands needed by the machine instruction 
and deliver them to the ALU for processing. An example of 
the flow of a typical Computer Control Unit is shown in Figure 
11. 


Assume the OP-CODE of the machine instruction that we 
fetch is 8 bits wide. This allows us to execute a minimum of 
256 different instructions. Assume also that an average of 6 
steps are needed to execute these instructions. Even if sepa- 
rate microprogram memory locations are used, a depth of this 
microprogram memory is only 1-1/2K (K = 1024). But in that 
case, the sequencer can almost be replaced by a simple 
counter. Usually we would like to share some micro-routines 
among different instructions. With very little effort, we can 
shrink the depth of the microprogram memory of Figure 10 to 
less than 1/2K. Of course the sequencer will be a little more 
sophisticated; it will perform conditional Branch and micro- 
subroutine CALL’s; but we still don’t need the complicated 
addressing schemes for microprogram control as were de- 
scribed earlier as a part of the machine instruction set. 


On the other hand, the width of our microprogram memory 
may be large — maybe 60 to 100 bits. This will depend on the 
number of control lines needed in our computer. This is of no 
great disadvantage since the price of PROM devices is drop- 
ping constantly. In a future chapter we will discuss techniques 
to reduce the depth and width of the microprogram memory to 
save cost. 


It is important to understand the distinction between machine 
level instructions and microprogram instructions. Figure 12 
shows a typical machine instruction for a 16 bit minicomputer 
that has an 8-bit opcode to identify one of 256 instructions; a 
4-bit source register specification to identify one of 16 source 
registers and a 4-bit destination register specification to iden- 
tify one of 16 destination registers. The microprogram instruc- 
tion of Figure 12 may contain from 32 to 128 bits in a typical 
design; or even more bits in a very fast, highly parallel mi- 
crocoded machine. This microinstruction word usually will con- 
tain fields for the ALU source operand, ALU function, ALU des- 
tination, status load enable, shift multiplexer control, bus 
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The machine instruction is 16-bits and consists of an op code, source register and destination register specifica- 
tion. The microprogram instruction defines all the elemental signals to control the various pieces of the machine. 





Figure 12. 
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cycle control, etc. These fields are used to control the various 
devices within the machine so that its execution is as desired 
on each clock cycle. This is more straightforward than using 
~ combinatorial logic and yields a more organized design. 


Let us now compare the depth-over-width (d/w) ratio of the 
computer's main memory to that of our microprogram mem- 
Ory. 


In the Am9080A type microprocessor, the data field is 8 bits 
and the address field is 16 bits, allowing direct addressing of 
64K locations. The ratio d/w is 8K. In some minicomputers, 
the data width is 16-32 bits and the addressing capability is 
64-128K. The d/w ratio is about the same. In larger computers 
with 32-64 bit data width, we find 256-512K deep memories or 
even deeper ones. The d/w ratio again is 8K at least. 


On the other hand, the d/w ratio in microprogram memories is 
seldom greater than a few tens. Even if we assume that it is 
2K deep and only 64 bits wide, we arrive at a d/w ratio of only 
32; usually it will be around 10. It is much easier to control a 
machine with a d/w ratio of 10 to 20 than to control one with 
d/w = 8K. 


ONE MORE WORD 


We have suggested a replacement of the “random logic” 
realization of the CCU by a micro-machine. We call this a 
“Microprogrammed Architecture”. Perhaps the biggest advan- 
tage of this type of architecture is the ease of structuring the 
control sequence. We allocate a bit or a group of bits in the 
microprogram memory to control a certain function (e.g.: ALU 
source register selection, ALU function, ALU destination selec- 
tion, condition selection, next address calculation selection, 
MDR destination selection, MAR source selection, etc., etc.) 
and for each microstep we write the appropriate state for 
these bits (LOW-HIGH) into this memory field. Later we will 
see that automated and sophisticated tools are avail- 
able to perform this microprogram writing. One such tool is 
AMDASM™ as available on System 29. But, this is not the only 
advantage of the microprogrammed architecture. 
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As nobody is perfect, some “bugs” may inadvertently slip into 
the design. In a random logic architecture, we will have to re- 
design and usually rebuild the whole computer. On the other 
hand, in a microprogrammed machine it is usually sufficient to 
change a couple of bits in the microprogram to rectify the 
problem. This is even easier if a RAM instead of a PROM is 
used during the development and debugging phases. Of 
course, we must be able to load this memory with the micro- 
program by some external means. Again, a powerful tool is 
available: AMD’s System/29™. 


Finally, let's face the reality: The marketing guys usually 
change their requirements (i.e., the instruction set) when you 
are 80% through your logic design. Now you have to start 
over from scratch. Not so! Change some microcode, perhaps 
very little hardware too and here you are! It is even more 
convenient when only additions to the existing instruction set 
are considered. Just add a few lines to your microprogram to 
comply with those new ideas! A mere few minutes using Sys- . 
tem 29 — That's flexibility! Incidentally, don’t tell the marketing 
guys how easy it is or you will NEVER get the product out!! 


SUMMARY 


The block diagram of Figure 13 shows a typical 16-bit 
minicomputer architecture. Also identified on this block dia- 
gram are various Am2900 family elements that might be used 
in each of these blocks. Such a design might use either 
4-Am2901A’s or 4-Am2903’s for the data path ALU. An 
Am2910 could be used as the microprogram sequencer for 
control of up to 4K words of microprogram memory. Also 
shown on the block diagram are the Am9130 and Am9140 
MOS Static RAM’s which are potential candidates for use in 
the computer's main memory. 


The following chapters will discuss various blocks of Figure 13 
in detail and give design examples for each section. Needless 
to say, the design engineer can appropriately tailor any design 
to meet his throughput requirements. Also, special algorithms 
can be executed by adding the appropriate hardware and mi- 
crocode to the blocks described. 
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Figure 13. A Generalized Computer Architecture. 
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CHAPTER Il 
MICROPROGRAMMED DESIGN 


INTRODUCTION 


A microprogrammed machine is one in which a coherent se- 
quence of microinstructions is used to execute various com- 
mands required by the machine. If the machine is a computer, 
each sequence of microinstructions can be made to execute a 
machine instruction. All of the little elemental tasks performed 
by the machine in executing the machine instruction are called 
microinstructions. The storage area for these microinstructions 
is usually called the microprogram memory. This technique was 
identified by Wilkes in the 1950’s as a structured approach to 
the random control logic in a computer. 


A microinstruction usually has two primary parts. These 
are: (1) the definition and control of all elemental micro- 
operations to be carried out and (2) the definition and control of 
the address of the next microinstruction to be executed. 


The definition of the various micro-operations to be carried out 
usually includes such things as ALU source operand selection, 
ALU function, ALU destination, carry control, shift control, inter- 
rupt control, data-in and data-out control and so forth. The def- 
initton of the next microinstruction function usually includes 
identifying the source selection of the next microinstruction ad- 
dress, and in some cases, supplying the actual value of that 
microinstruction address. 


Microprogrammed machines are usually distinguished from 
non-microprogrammed machines in the following manner. Old- 
er, non-microprogrammed machines implemented the control 
function by using combinations of gates and flip-flops con- 
nected in a somewhat random fashion in order to generate the 
required timing and control signals for the machine. Micropro- 
grammed machines, on the other hand, are normally considered 
highly ordered and more organized with regard to the control 
function field. In its simplest definition, a microprogram control 
unit consists of the microprogram memory and the structure 
required to determine the address of the next microinstruction. 


Microprogramming its normally selected by the design engineer 
as a control technique for finite state machines because it im- 
proves flexibility, performance, and LSI utilization. Several addi- 
tional key features of microprogrammed designs are listed be- 
low: 


More structured organization 

Diagnostics can be implemented easily 

Design changes are simple 

Field updates are easy 

Adaptations are straightforward 

System definition can be expanded to include new features 
Documentation and Service are easier 

Design aids are available 

Cost and design time are reduced 


THE MICROPROGRAM MEMORY 


The microprogram memory is simply an N word by M bit mem- 
ory used to hold the various microinstructions. For an N word 
memory, the address locations are usually defined as location 
0 through N—1. For example, a 256-word microprogram mem- 
ory will have address locations 0 through 255. Each word of 
the microprogram memory consists of M bits. These M bits are 
usually broken into various field definitions and the fields can 
consist of various numbers of bits. It is the definition of the var- 
ious fields of a microprogram word that is usually referred to as 
FORMATTING. 


An example of how microinstruction fields are defined in a typ- 
ical machine microprogram memory word is as follows: 


Field 1 — General purpose 

Field 2 — Branch address 

Field 3 — Next microinstruction address control 
Field 4 — Condition code multiplexer control 
Field 5 — Interrupt control 

Field 6 — Fast clock/slow clock select 

Field 7 — Carry control 

Field 8 — ALU source operand control 

Field 9 — ALU function control 

Field 10 — ALU destination control 

Field 11 — Shift multiplexer control 

Field 12 — etc. 


EXECUTING MICROINSTRUCTONS 


Once the microprogram format has been defined, it is neces- 
sary to execute sequences of these microinstructions if the 
machine is to perform any real function. In its simplest form, all 
that is required to sequence through a series of microinstruc- 
tions is a microprogram address counter. The microprogram 
address counter simply increments by one on each clock cycle 
to select the address of the next microinstruction. For example, 
if the microprogram address counter contains address 23, the 
next clock cycle will increment the counter and it will select ad- 
dress 24. The counter will continue to increment on each clock 
cycle thereby selecting address 25, address 26, address 27, 
and so forth. If this were the only control available, the machine 
would not be very flexible and it would be able to execute only 
a fixed pattern of microinstructions. 


The technique of continuing from one microinstruction to the 
next sequential microinstruction is usually referred to as CON- 
TINUE. Thus, in microprogram control definition, we will use the 
CONTINUE (CONT) statement to mean simply incrementing to 
the next microinstruction. 


MICROPROGRAM JUMPING 


If the microprogram control unit is to have the ability to select 
other than the next microinstruction, the control unit must be 
able to load a JUMP address. The load contro! of a counter 
can be a single bit field within the microprogram word format. 
Let us call this one-bit field the microprogram address counter 
load enable bit. When this bit is at logic 0, a load will be inhib- 
tted and when this bit is a logic 1, a load will be enabled. If the 
load is enabled, the JUMP address contained within the micro- 
program memory will be parallel! loaded into the microprogram 
address counter. This results in the ability to perform an N-way 
branch. For example, if the branch address field is eight bits wide, 
a JUMP to any address in the memory space from word 0 through 
word 255 can be performed. 


This simple branching control feature allows a microprogram 
memory controller to execute sequential microinstructions or 
perform a JUMP (JMP) to any address either before or after 
the address currently contained in the microprogram address 
counter. 


CONDITIONAL JUMPING 


While the JUMP instruction has added some flexibility to the 
sequencing of microprogram instructions, the controller still 
lacks any decision-making capability. This decision-making 
Capability is provided by the CONDITIONAL JUMP (COND 
JMP) instruction. Figure 1 shows a functional block diagram of 
a microprogram memory/address controller providing the capa- 
bility to jump on either of two different conditions. In this exam- 
ple, the load select control is a two-bit field used to control a 
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Figure 1. A Two-Bit Control Field Can be Used 
to Select CONTINUE, BRANCH, or 
CONDITIONAL BRANCH. 


four-input multiplexer. When the two-bit field is equivalent to bi- 
nary zero, the multiplexer selects the zero input which forces 
the load control inactive. Thus, the CONTINUE microprogram 
control instruction is executed. When the two-bit load select 
field contains binary one, the D, input of the multiplexer is 
selected. Now, the load control is a function of the Condition 1 
input. If Condition 1 is logic 0, the microprogram address 
counter increments and if Condition 1 is logic 1, the jump ad- 
dress will be parallel loaded in the next clock cycle. This opera- 
tion is defined as a CONDITIONAL JUMP. If the load select 
input contains binary 2, the Dz input is selected and the same 
conditional function is performed with respect to the Condition 2 
input. If the load select field contains binary 3, the D3 input of 
the multiplexer is selected. Since the D3 input is tied to logic 
HIGH, this forces the microprogram address counter to the load 
mode independent of anything else. Thus, the jump address is 
loaded into the microprogram address counter on the next 
clock cycle and an UNCONDITIONAL JUMP is executed. This 
load select control function definition is shown in Table 1. 


TABLE 1. 
LOAD SELECT CONTROL FUNCTION. 


$189 | ___—Function 


Continue 


Jump Condition 1 True 
Jump Condition 2 True 
Jump Unconditional 





OVERLAPPING THE MICROPROGRAM 
INSTRUCTION FETCH 


Now that a few basic microprogram address control instructions 
have been defined, let us examine the control instructions used 
In a microprogram control unit featuring the overlap fetching of 
the next microinstruction. This technique is also known as 
“pipelining”. The block diagram for such a microprogram con- 
trol unit is shown in Figure 2. The key difference when com- 
pared with previous microprogrammed architectures is the exis- 
tence of the “pipeline register” at the output of the microprogram 
memory. By definition, the pipeline register (or microword 
register) contains the microinstruction currently being executed 
by the machine. Simultaneously, while this microinstruction is 
being executed, the address of the next microinstruction is 
applied to the microprogram memory and the contents of that 
memory word are being fetched and set-up at the inputs to the 
pipeline register. This technique of pipelining can be used to 
improve the performance of the microprogram control unit. This 
results because the contents of the microprogram memory 
word required for the next cycle are being fetched on an over- 
lapping basis with the actual execution of the current micro- 
program word. It should be realized that when the pipeline ap- 
proach is used, the design engineer must be aware of the fact 
that some registers contain the results of the previous mi- 
croinstruction executed, some registers contain the current mi- 
croinstruction being executed, and some registers contain data 
for the next microinstruction to be executed. 
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Figure 2. Overlapping (or Pipelining) the Fetch of the 
Next Microinstruction. 


Let us now compare the block diagram of Figure 2 with that 
shown in Figure 1. The major difference, of course, is the addi- 
tion of the pipeline register at the output of the microprogram 
control memory. Also, notice the addition of the address multi- 
plexer at the source of the microprogram memory address. 
This address multiplexer is used to select the microprogram 
counter register or the pipeline register as the source of the 
next address for the microprogram memory. The condition code 
multiplexer is used to control the address multiplexer in this ad- 
dfess selection. By placing an incrementer at the output of the 
address multiplexer, is is possible to always generate the cur- 
rent microprogram address “plus one” at the input of the micro- 
program counter register. 


In Figure 1, the microprogram address counter was described 
as a counter and could be a device such as the Am25LS161 
counter. In the implementation as shown in Figure 2, the 
Am25LS161 counter is not appropriate. Instead, an incrementer 
and register are used to give the equivalent effect of a counter. 


The key difference between using a true binary counter and the 
incrementer register described here is as follows. When the 
jump address from the pipeline register is selected by the mul- 
tiplexer, the incrementer will combinatorially prepare that ad- 
dress plus one for entry into the microprogram counter register. 
This entry will occur on the LOW-to-HIGH transition of the 
clock. Thus, the microprogram counter register can always be 
made to contain address plus one, independent of the selection 
of the next microinstruction address. When the address multi- 
plexer is switched so that the microprogram counter register is 
selected as the source of the microprogram memory address, 
the incrementer will again set-up address plus one for entry into 
the microprogram counter register. Thus, when the address 
multiplexer selects the microprogram counter register, the ad- 
dress multiplexer, incrementer and microprogram counter regis- 
ter appear to operate as a normal binary counter. 


The condition code multiplexer SoS; operates in exactly the 
same fashion as described for the condition code multiplexer of 
Figure 1. That ts, binary zero in the pipeline register (the cur- 
rent microinstruction being executed) forces an unconditional 
selection of the microprogram register via Do. Binary one or bi- 
nary two tn the next address select control bits of the pipeline 
register cause a conditional selection at the address multiplexer 
via D; or Do. Thus, a CONDITIONAL JUMP can be executed. 
Binary three in the next address select portion of the pipeline 
register causes an UNCONDITIONAL JUMP instruction to be 
executed via D3. 


When the overall machine timing is studied, it will be observed 
that the key difference between overlap fetching and non- 
overlap fetching involves the propagation delay of the micro- 
program memory. In the non-pipelined architecture, the micro- 
program memory propagation delay must be added to the 
propagation delay of all the other elements of the machine. In 
the overlap fetch architecture, the propagation delay associated 
with the next microprogram memory address fetch is a sepa- 
rate loop independent of the other portion of the machine. 


SUBROUTINING IN MICROPROGRAMMING CONTROL 


Thus far, we have examined the CONTINUE instruction as well 
as the CONDITIONAL and UNCONDITIONAL JUMP instruc- 
tions for overlap fetch. Just as in the programming of minicom- 
puters and microcomputers, the advantages of SUBROUTIN- 
ING can be realized in microprogramming. The idea here, of 
course, is that the same block of microcode (or even a single 
microinstruction) can be shared by several microinstruction 
sequences. This results in an overall reduction in the total 


number of microprogram memory words required by the de- 
sign. If we are to jump to a subroutine, what is required is the 
ability to store an address to which the subroutine should return 
when it has completed its execution. Examining the block dia- 
gram of Figure 3, we see the addition of a subroutine and loop 
(push/pop) stack (also called the file) and its associated stack 
pointer. The control signals required by the stack are an enable 
stack signal (FILE ENABLE = FE) which will be used to tell the 
fille whenever we wish to perform a push or a pop, and a 
push/pop control (PUP) used to control the direction of the 
stack pointer (push or pop). 


In this architecture, the stack pointer always points to the address 
of the last microinstruction written on the stack. This al- 
lows the “next address multiplexer’ to read the stack at any 
time via port F. When this selection is performed, the last word 
written on the stack will be the word applied to the micropro- 
gram memory. The condition code multiplexer of the previous 
example has also been replaced by a next address control unit. 
This next address control unit (Am29811A) can execute 16 dif- 
ferent next address control functions where most of these func- 
tions are conditional. Thus, the device has four instruction in- 
puts as well as one condition code test input which is con- 
nected to the condition code multiplexer. Note also that the 
next address control field of the microprogram word has been 
expanded to a four-bit field. Outputs from the Am29811A next 
address contro! block are used to control the stack pointer and 
the next address multiplexer of the Am2911. In addition, the 
device has outputs to control the three-state enable of the 
pipeline register and the three-state enable of the starting ad- 
dress decode PROM. Also, the architecture has a counter that 
can be used as a loop-counter or event counter. 


The 16 instructions associated with the Am29811A are listed in 
Table 2. As is easily seen by referring to Table 2, three of the 
instructions in this set are associated with subroutining in mi- 
croprogram memory. The first instruction of this set, is a simple 
conditional JUMP-TO-SUBROUTINE where the source of the 
subroutine address is in the pipeline register. The RETURN- 
FROM-SUBROUTINE instruction is also conditional and is used 
to return to the next microinstruction following the JUMP-TO- 
SUBROUTINE instruction. There is also a conditional JUMP- 
TO-ONE-OF-TWO-SUBROUTINES, where the subroutine ad- 
dress is either in the PIPELINE register or in the internal REG- 
ISTER tn the Am2911. This instruction will be explained in 
more detail later. 


TYPICAL COMPUTER CONTROL UNIT 
ARCHITECTURE USING THE 
Am2911 AND Am29811A 


The microprogram memory control unit block diagram of Figure 
3 Is easily implemented using the Am2911 and Am29811A. 
This architecture provides a structured state machine design 
capable of executing many highly sophisticated next address 
control instructions. The Am2911 contains a next address mul- 
tiplexer that provides four different inputs from which the ad- 
dress of the next microinstruction can be selected. These are 
the direct input (D), the register input (R), the program counter 
(PC), and the file (F). The starting address decoder (mapping 
PROM) output and the pipeline register output are connected 
together at the D input to the Am2911 and are operated in the 
three-state mode. 


The architecture of Figure 3 shows an instruction register ca- 
pable of being loaded with a machine instruction word from the 
data bus. The op code portion of the instruction is decoded 
using a mapping PROM to arrive at a starting address for the 
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TABLE 2. FUNCTIONAL DESCRIPTION OF Am29811A INSTRUCTION SET. 
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Figure 3. A Typical Computer Control Unit Using the Am2911 and Am29811A. 
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TABLE 3. PIN FUNCTIONS. 


[Aevaton [Name ow 


Direct Input Bit! 
Instruction Bit! 
Condition Code 
Condition Code Enable 


Carry-In 
Register Load 


Output Enable 


Clock Pulse 

+5 Volts 

Ground 

Microprogram Address Bit 1 
Full 

Pipeline Address Enable 


Map Address Enable 


Vector Address Enable 


Direct input to register/counter and multiplexer. Dg is LSB 
Selects one-of-sixteen instructions for the Am2910 
Used as test criterion. Pass test is a LOW on CC. 


Whenever the signal ts HIGH, CC is ignored and the part operates 
as though CC were true (LOW). 


Low order carry input to incrementer for microprogram counter 


When LOW forces loading of register/counter regardless of 
instruction or condition 


Three-state control of Y, outputs 
Triggers all internal state changes at LOW-to-HIGH edge 


Address to microprogram memory. Yg is LSB, Y17 !s MSB 
Indicates that five items are on the stack 


Can select #1 source (usually Pipeline Register) as direct 
input source 


Can select #2 source (usually Mapping PROM or PLA) as 
direct input source 


Can select #3 source (for example, Interrupt Starting Address) 


as direct input source 


microinstruction sequence required to execute the machine in- 
struction. When the microprogram memory address ts to be the 
first microinstruction of the machine instruction sequence, the 
Am29811A next address control unit selects the multiplexer D 
input and enables the three-state output from the mapping 
PROM. When the current microinstruction being executed is 
selecting the next microinstruction address as a JUMP function, 
the JUMP address will be available at the multiplexer D input. 
This is accomplished by having the Am29811A select the next 
address multiplexer D input and also enabling the three-state 
output of the pipeline register branch address field. The register 
enable input to the Am2911 is connected to ground so that this 
register will always load the value at the Am2911 D input. The 
value at D is clocked into the Am2911’s register (R)-at the end 
of the current microcycle, which makes the D value of this mi- 
crocycle available as the R value of the next microcycle. Thus, 
by using the branch address field of two sequential micro- 
instructions, a conditional JUMP-TO-ONE-OF-TWO- 
SUBROUTINES or a conditional JUMP-TO-ONE-OF-TWO- 
BRANCH-ADDRESSES can be executed by either selecting 
the D input or the R input of the next address multiplexer. 


When sequencing through continuous microinstructions in mi- 
croprogram memory, the program counter in the Am2911 is 
used. Here, the Am29811A simply selects the PC input of the 
next address multiplexer. In addition, most of these instructions 
enable the three-state outputs of the pipeline register as- 
sociated with the branch address field, which allows the register 
within the Am2911 to be loaded. 


The 4 x 4 stack in the Am2911 ts used for looping and sub- 
routining in microprogram operations. Up to four levels of sub- 
routines or loops can be nested. Also, loops and subroutines 
can be intermixed as long as the four-word depth of the stack 
is not exceeded. 


ARCHITECTURE OF THE Am2910 


The Am2910 1s a bipolar microprogram controller intended for 
use in high-speed microprocessor applications. It allows ad- 
dressing of up to 4K words of microprogram. A block diagram 
is shown tn Figure 4. 








The controller contains a four-input multiplexer that is used to 
select either the register/counter, direct input, microprogram 
counter, or stack as the source of the next microinstruction ad- 
dress. 


The register/counter consists of 12 D-type, edge-triggered flip- 
flops, with a common clock enable. When its load control, RLD, 
is LOW, new data is loaded on a positive clock transition. A 
few instructions include load; in most systems, these instruc- 
tions will be sufficient, simplifying the microcode. The output of 
the register/counter is available to the multiplexer as a source 
for the next microinstruction address. The direct input furnishes 
a source of data for loading the register/counter. 
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Figure 4. Am2910 Block Diagram. 
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The Am2910 contains a microprogram counter (wPC) that is 
composed of a 12-bit incrementer followed by a 12-bit register. 
The «PC can be used in either of two ways When the carry-in 
to the incrementer is HIGH, the microprogram register is loaded 
on the next clock cycle with the current Y output word plus one 
(Y+1—pPC). Sequential microinstructions are thus executed. 
When the carry-in is LOW, the incrementer passes the Y output 
word unmodified so that w~PC is reloaded with the same Y word 
on the next clock cycle (Y >PC). The same microinstruction 
is thus executed any number of times. 


The third source for the multiplexer is the direct (D) inputs. This 
source is used for branching. 


The fourth source available at the multiplexer input is a 5-word 
by 12-bit stack (file). The stack is used to provide return ad- 
dress linkage when executing microsubroutines or loops. The 
stack contains a build-in stack pointer (SP) which always points 
to the last file word written. This allows stack reference opera- 
tions (looping) to be performed without a pop. The stack pointer 
operates as an up/down counter. During microinstructions 2, 4 
and 5, the PUSH operation is performed. This causes the stack 
pointer to increment and the file to be written with the required 
return linkage. On the cycle following the PUSH, the return 
data is at the new location pointed to by the stack pointer. 


During six other microinstructions, a POP operation occurs. 
This places the information at the top of the stack onto the Y 
outputs. The stack pointer decrements at the next rising clock 
edge following a POP, effectively removing old information from 
the top of the stack. 


The stack pointer linkage 1s such that any sequence of pushes, 
pops or stack references can be achieved. At RESET (Instruc- 
tion 0), the depth of nesting becomes zero. For each PUSH, 
the nesting depth increases by one; for each POP, the depth 
decreases by one. The depth can grow to five. After a depth of 
five is reached, FULL goes LOW. Any further PUSHes onto a 
full stack overwrites information at the top of the stack, but 
leaves the stack pointer unchanged. This operation will usually 
destroy useful information and is normally avoided. A POP from 
an empty stack places non-meaningful data on the Y outputs, 
but is otherwise safe. The stack pointer remains at zero 
whenever a POP is attempted from a stack already empty. 


TABLE 4. Am2910 MICR 
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The register/counter is operated during three microinstructions 
(8, 9, 15) as a 12-bit down counter, with result = zero available 
as a microinstruction branch test criterion. This provides effi- 
cient iteration of microinstructions. The register/counter is ar- 
ranged such that if it is preloaded with a number N and then 
used as a loop termination counter, the sequence will be exe- 
cuted exactly N+1 times. During instruction 15, a three-way 
branch under combined contro! of the loop counter and the 
condition code is available. 


The device provides three-state Y outputs. These can be par- 
ticularly useful in designs requiring automatic checkout of the 
processor. The microprogram controller outputs can be forced 
into the high-impedance state, and pre-programmed sequences 
of microinstructions can be executed via external access to the 
address lines. 


OPERATION 


Table 4 shows the result of each instruction in controlling the 
multiplexer which determines the Y outputs, and In controlling the 
three enable signals PL, MAP and VECT. The effect on the «PC, 
the register/counter, and the stack after the next positive-going 
clock edge is also shown. The multiplexer determines which 
internal source drives the Y outputs. The value loaded into «PC is 
either identical to the Y output, or else one greater, as determined 
by Cl. For each instruction, one and only one of the three outputs 
PL, MAP and VECT is LOW. If these outputs control three-state 
enables for the primary source of microprogram jumps (usually 
part of a pipeline register), a PROM which maps the instruction to 
a microinstruction starting location, and an optional third source 
(often a vector from a DMA or interrupt source), respectively, the 
three-state sources can drive the D inputs without further logic. 








Several inputs, as shown in Table 4 can modify instruction execu- 
tion. The combination CC HIGH and CCEN LOW is used as atest 
in 10 of the 16 instructions. RLD, when LOW, causes the D input 
to be loaded into the register/counter, overriding any HOLD or 
DEC operation specified in the instruction. OE, normally LOW, 
may be forced HIGH to remove the Am2910 Y outputs from a 
three-state bus. 
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The stack, a five-word last-in, first-out 12-bit memory, has a 
pointer which addresses the value presently on the top of the 
stack. Explicit control of the stack pointer occurs during instruc- 
tion 0 (RESET), which makes the stack empty by resetting the SP 
to zero. After a RESET, and whenever else the stack is empty, the 
content of the top of stack is undefined until a PUSH occurs. Any 
POPs performed while the stack is empty put undefined data on 
the F outputs and leave the stack pointer at zero. Any time the 
stack is full (five more PUSHes than POPs have occurred since 
the stack was last empty), the FULL warning output occurs. No 
additional PUSH should be attempted onto a full stack; if tried, 
information at the top of the stack will be overwritten and lost. 


THE Am2910 INSTRUCTION SET 


The Am2910 provides 16 instructions which select the address of 
the next microinstruction to be executed. Four of the instructions 
are unconditional — their effect depends only on the instruction. 
Ten of the instructions have an effect which Is partially controlled 
by an external, data-dependent condition. Three of the instruc- 
tions have an effect which ts partially controlled by the contents of 
the internal register/counter. The instruction set is shown in Table 
4. In this discussion it is assumed that Cl is tied HIGH. 


In the ten conditional instructions, the result of the data- 
dependent tests applied to CC. If the CC input is LOW, the test is 
considered to have been passed, and the action specified in the 
name occurs; otherwise, the test has failed and an alternate 
(often simply the execution of the next sequential microinstruc- 
tion) occurs. Testing of CC may be disabled for a specific micro- 
instruction by setting CCEN HIGH, which unconditionally forces 
the action specified in the name; that ts, it forces a pass. Other 
ways of using CCEN include (1) tying it HIGH, which is useful if no 
microinstruction is data-dependent; (2) tying it LOW if data- 
dependent instructions are never forced unconditionally; or (3) 
tying it to the source of Am2910 instruction bit Ip, which leaves 
instructions 4, 6 and 10 as data-dependent but makes others 
unconditional. All of these tricks save one bit of microcode width. 


The effect of three instructions depends on the contents of the 
register/counter. Unless the counter holds a value of zero, it is 
decremented; If it does hold zero, it is held and a different micro- 
program next address is selected. These instructions are useful 
for executing a microinstruction loop a known number of times. 
Instruction 15 is affected both by the external condition code and 
the internal register/counter. 


Perhaps the best technique for understanding the Am2910 is to 
simply take each instruction and review its operation. In order to 
provide some feel for the actual execution of these instructions, 
Figure 5 is included and depicts examples of all 16 instructions. 


The examples given in Figure 5 should be interpreted in the 
following manner: The intent is to show microprogram flow as 
various microprogram memory words are executed. For exam- 
ple, the CONTINUE instruction, instruction number 14, as shown 
in Figure 5, simply means that the contents of microprogram 
memory word 50 is executed, then the contents of word 51 is 
executed. This is followed by the contents of microprogram 
memory word 52 and the contents of microprogram memory word 
53. The microprogram addresses used in the examples were 
arbitrarily chosen and have no meaning other than to show in- 
struction flow. The exception to this is the first example, JUMP 
ZERO, which forces the microprogram location counter to ad- 
dress ZERO. Each dot refers to the time that the contents of the 
microprogram memory word is in the pipeline register. While no 
special symbology is used for the conditional instructions, the text 
to follow will explain what the conditional choices are in each 
example. 


It might be appropriate at this time to mention that AMD has a 
microprogram assembler called AMDASM, which has the capa- 
bility of using the Am2910 instructions in symbolic representa- 
tion. AMDASM’s Am2910 instruction symbolics (or mnemonics) 
are given in Figure 5 for each instruction and are also shown in 
Table 4. 


Instruction 0, JZ (JUMP and ZERO, or RESET) unconditionally 
specifies that the address of the next microinstruction Is zero. 
Many designs use this feature for power-up Sequences and pro- 
vide the power-up firmware beginning at microprogram memory 
word location 0. 


Instruction 1 is a CONDITIONAL JUMP-TO-SUBROUTINE via 
the address provided tn the pipeline register. As shown in Figure 
5, the machine might have executed words at address 50, 51 and 
52. When the contents of address 52 Is in the pipeline register, the 
next address control function is the CONDITIONAL JUMP-TO- 
SUBROUTINE. Here, if the test is passed, the next instruction 
executed will be the contents of microprogram memory location 
90. If the test failed, the JUMP-TO-SUBROUTINE will not be 
executed; the contents of microprogram memory location 53 will 
be executed instead. Thus, the CONDITIONAL JUMP-TO- 
SUBROUTINE instruction at location 52 will cause the instruction 
either in location 90 or in location 53 to be executed next. If the 
TEST input is such that location 90 is selected, value 53 will be 
pushed onto the internal stack. This provides the return linkage 
for the machine when the subroutine beginning at location 90 Is 
completed. In this example, the subroutine was completed at 
location 93 and a RETURN-FROM-SUBROUTINE would be 
found at location 93. 


Instruction 2 is the JUMP MAP instruction. This is an uncondi- 
tional instruction which causes the MAP output to be enabled so 
that the next microinstruction location is determined by the ad- 
dress supplied via the mapping PROMs. Normally the JUMP 
MAP instruction is used at the end of the instruction fetch se- 
quence for the machine. In the example of Figure 5, microinstruc- 
tions at locations 50, 51, 52 and 53 might have been the fetch 
sequence and at its completion at location 53, the jump map 
function would be contained in the pipeline register. This example 
shows the mapping PROM outputs to be 90; therefore, an uncon- 
ditional jump to microprogram memory address 90 is performed. 


Instruction 3, CONDITIONAL JUMP PIPELINE, derives Its 
branch address from the pipeline register branch address value 
(BRo-BR;, in Figure 6). This instruction provides a technique for 
branching to various microprogram sequences depending upon 
the test condition inputs. Quite often, state machines are de- 
signed which simply execute tests on various inputs waiting for 
the condition to come true. When the true condition is reached, 
the machine then branches and executes a set of microinstruc- 
tions to perform some function. This usually has the effect of 
resetting the input being tested unti!] some point in the future. 
Figure 5 shows the conditional jump via the pipeline register 
address at location 52. When the contents of microprogram 
memory word 52 are in the pipeline register, the next address will 
be either location 53 or location 30 in this example. If the test is 
passed, the value currently in the pipeline register (3) will be 
selected. If the test fails, the next address selected will be con- 
tained in the microprogram counter which, in this example, is 53. 


Instruction 4 is the PUSH/CONDITIONAL LOAD COUNTER in- 
struction and is used primarily for setting up loops in micropro- 
gram firmware. In Figure 5, when instruction 52 is in the pipeline 
register, a PUSH will be made onto the stack and the counter will 
be loaded based on the condition. When a PUSH occurs, the 
value pushed is always the next sequential instruction address. In 
this case, the address is 53. If the test fails, the counter is not 
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loaded; if it is passed, the counter is loaded with the value con- 
tained in the pipeline register branch address field. Thus, a single 
microinstruction can be used to set up a loop to be executed a 
specific number of times. Instruction 8 will describe how to use 
the pushed value and the register/counter for looping. 


Instruction 5 is a CONDITIONAL JUMP-TO-SUBROUTINE via 
the register/counter or the contents of the PIPELINE register. As 
shown in Figure 5, a PUSH Is always performed and one of two 
subroutines executed. In this example, either the subroutine be- 
ginning at address 80 or the subroutine beginning at address 90 
will be performed. A return-from-subroutine (instruction number 
10) returns the microprogram flow to address 55. In order for this 
microinstruction control sequence to operate correctly, both the 
next address fields of instruction 53 and the next address fields of 
instruction 54 would have to contain the proper value. Let’s 
assume that the branch address fields of instruction 53 contain 
the value 90 so that it will be in the Am2910 register/counter when 
the contents of address 54 are in the pipeline register. This 
requires that instruction at address 53 load the register/counter. 
Now, during the execution of instruction 5 (at address 54), if the 
test failed, the contents of the register (value = 90) will select the 
address of the next microinstruction. If the test input passes, the 
pipeline register contents (value = 80) will determine the address 
of the next microinstruction. Therefore, this instruction provides 
the ability to select one of two subroutines to be executed based 
on a test condition. 


Instruction 6 is a CONDITIONAL JUMP VECTOR instruction 
which provides the capability to take the branch address from a 
third source heretofore not discussed. In order for this instruction 
to be useful, the Am2910 output, VECT, is used to control a 
three-state control input of a register, buffer, or PROM containing 
the next microprogram address. This instruction provides one 
technique for performing interrupt type branching at the micro- 
program level. Since this instruction is conditional, a pass causes 
the next address to be taken from the vector source, while failure 
causes the next address to be taken from the microprogram 
counter. In the example of Figure 5, if the CONDITIONAL JUMP 
VECTOR instruction is contained at location 52, execution will 
continue at vector address 20 if the TEST input is HIGH and the 
microinstruction at address 53 will be executed if the TEST input 
is LOW. 


Instruction 7 is a CONDITIONAL JUMP via the contents of the 
Am2910 REGISTER/COUNTER or the contents of the PIPELINE 
register. This instruction ts very similar to instruction 5; the condi- 
tional jump-to-subroutine via R or PL. The major difference be- 
tween instruction 5 and instruction 7 is that no push onto the stack 
is performed with 7. Figure 5 depicts this instruction as a branch 
to one of two locations depending on the test condition. The 
example assumes the pipeline register contains the value 70 
when the contents of address 52 is being executed. As the 
contents of address 53 is clocked into the pipeline register, the 
value 70 is loaded into the register/counter in the Am2910. The 
value 80 is available when the contents of address 53 is in the 
pipeline register Thus, control is transferred to either address 70 
or address 80 depending on the test condition. 


Instruction 8 is the REPEAT LOOP, COUNTER # ZERO instruc- 
tion. This microinstruction makes use of the decrementing capa- 
bility of the register/counter. To be useful, some previous instruc- 
tion, such as 4, must have loaded a count value into the register/ 
counter. This instruction checks to see whether the register/ 
counter contains a non-zero value. If so, the register/counter is 
decremented, and the address of the next microinstruction is 
taken from the top of the stack. If the register counter contains 
zero, the loop exit condition is occurring; control falls through to 


the next sequential microinstruction by selecting uPC; the stack 
is POP’d by decrementing the stack pointer, but the contents of 
the top of the stack are thrown away. 


An example of the REPEAT LOOP, COUNTER # ZERO instruc- 
tion is shown in Figure 5. In this example, location 50 most likely 
would contain a PUSH/CONDITIONAL LOAD COUNTER in- 
struction which would have caused address 51 to be PUSHed on 
the stack and the counter to be loaded with the proper value for 
looping the desired number of times. 


In this example, since the loop test is made at the end of the 
instructions to be repeated (microaddress 54), the proper value to 
be loaded by the instruction at address 50 is one less than the 
desired number of passes through the loop. This method allows a 
loop to be executed from 0 to 4095 times. 


Single-microinstruction loops provide a highly efficient capability 
for executing a specific microinstruction a fixed number of times. 
Examples include fixed rotates, byte swap, fixed point multiply, 
and fixed point divide. 


Instruction 9 is the REPEAT PIPELINE REGISTER, COUNTER 
# ZERO instruction. This instruction is similar to instruction 8 
except that the branch address now comes from the pipeline 
register rather than the file. In some cases, this instruction may be 
thought of as a one-word file extension; that is, by using this 
instruction, a loop with the counter can still be performed when 
subroutines are nested five deep. This instruction’s operation is 
very similar to that of instruction 8. The differences are that on this 
instruction, a failed test condition causes the source of the next 
microinstruction address to be the D inputs; and, when the test 
condition is passed, this instruction does not perform a POP 
because the stack is not being used. 


In the example of Figure 5, the REPEAT PIPELINE, COUNTER 
# ZERO instruction ts instruction 52 and is shown as a single 
microinstruction loop. The address in the pipeline register would 
be 52. Instruction 51 in this example could be the LOAD 
COUNTER AND CONTINUE instruction (number 12). While the 
example shows a single microinstruction loop, by simply chang- 
ing the address in a pipeline register, multi-instruction loops can 
be performed in this manner for a fixed number of times as 
determined by the counter. 


Instruction 10 is the conditional RETURN-FROM-SUBROUTINE 
instruction. As the name implies, this instruction is used to branch 
from the subroutine back to the next microinstruction address 
following the subroutine call. Since this instruction is conditional, 
the return is performed only if the test is passed. If the test ts 
failed, the next sequential microinstruction is performed. The 
example in Figure 5 depicts the use of the conditional RETURN- 
FROM-SUBROUTINE instruction in both the conditional and the 
unconditional modes. This example first shows a jump-to- 
subroutine at instruction location 52 where control ts transferred 
to location 90. At location 93, a conditional RETURN-FROM- 
SUBROUTINE instruction ts performed. If the test is passed, the 
stack is accessed and the program will transfer to the next instruc- 
tion at address 53. If the test is failed, the next microinstruction at 
address 94 will be executed. The program will continue to ad- 
dress 97 where the subroutine is complete. To perform an un- 
conditional RETURN-FROM-SUBROUTINE, the conditional 
RETURN-FROM-SUBROUTINE instruction is executed uncon- 
ditionally; the microinstruction at address 97 is programmed to 
force CCEN HIGH, disabling the test and the forced PASS 
causes an unconditional return. 


Instruction 11 is the CONDITIONAL JUMP PIPELINE register 
address and POP stack instruction. This instruction provides 
another technique for loop termination and stack maintenance. 
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The example in Figure 5 shows a loop being performed from 
address 55 back to address 51. The instructions at locations 52, 
53 and 54 are all conditional JUMP and POP instructions. At 
address 52, if the TEST input is passed, a branch will be made to 
address 70 and the stack will be properly maintained via a POP. 
Should the test fail, the instruction at location 53 (the next sequen- 
tial instruction) will be executed. Likewise, at address 53, either 
the instruction at 90 or 54 will be subsequently executed, respec- 
tive to the test being passed or failed. The instruction at 54 follows 
the same rules, going to either 80 or 55. An instruction sequence 
as described here, using the CONDITIONAL JUMP PIPELINE 
and POP instruction, is very useful when several inputs are being 
tested and the microprogram ts looping waiting for any of the 
inputs being tested to occur before proceeding to another se- 
quence of instructions. This provides the powerful jump-table 
programming technique at the firmware level. 


Instruction 12 is the LOAD COUNTER AND CONTINUE instruc- 
tion, which simply enables the counter to be loaded with the value 
at its parallel inputs. These inputs are normally connected to the 
pipeline branch address field which (in the architecture being 
described here) serves to supply either a branch address or a 
counter value depending upon the microinstruction being exe- 
cuted. There are altogether three ways of loading the counter — 
the explicit load by this instruction 12; the conditional load in- 
cluded as part of instruction 4; and the use of the RLD input along 
with any instruction. The use of RLD with any instruction over- 
rides any counting or decrementation specified in the instruction, 
calling for a load instead. Its use provides additional microinstruc- 
tion power, at the expense of one bit of microinstruction width. 
This instruction 12 is exactly equivalent to the combination of 
instruction 14 and RLD LOW. Its purpose is to provide a simple 
capability to load the register/counter in those implementations 
which do not provide microprogrammed control for RLD. 


Instruction 13 is the TEST END-OF-LOOP instruction, which 
provides the capability of conditionally exiting a loop at the bot- 
tom; that is, this is a conditional instruction that will cause the 
microprogram to loop, via the file, if the test is failed else to 
continue to the next sequential instruction. The example in Figure 
5 shows the TEST END-OF-LOOP microinstruction at address 
56. If the test fails, the microprogram will branch to address 52. 
Address 52 is on the stack because a PUSH instruction had been 
executed at address 51. If the test is passed at instruction 56, the 
loop is terminated and the next sequential microinstruction at 
address 57 is being executed, which also causes the stack to be 
POPd; thus, accomplishing the required stack maintenance. 


Instruction 14 is the CONTINUE instruction, which simply causes 
the microprogram counter to increment so that the next sequen- 
tial microinstruction is executed. This is the simplest microinstruc- 
tion of all and should be the default instruction which the firmware 
requests whenever there is nothing better to do. 


Instruction 15, THREE-WAY BRANCH, is the most complex. It 
provides for testing of both a data-dependent condition and the 
counter during one microinstruction and provides for selecting 
among one of three microinstruction addresses as the next mi- 
croinstruction to be performed. Like instruction 8, a previous 
instruction will have loaded a count into the register/counter while 
pushing a microbranch address onto the stack. Instruction 15 
performs a decrement-and-branch-until-zero function similar to 
instruction 8. The next address !s taken from the top of the stack 
until the count reaches zero; then the next address comes from 
the pipeline register. The above action continues as long as the 
test condition fails. If at any execution of instruction 15 the test 
condition is passed, no branch is taken; the microprogram 
counter register furnishes the next address. When the loop ts 


ended, either by the count becoming zero, or by passing the 
conditional test, the stack is POP’d by decrementing the stack 
pointer, since interest in the value contained at the top of the stack 
is then complete. 


The application of instruction 15 can enhance performance of a 
variety of machine-level instructions. For instance, (1) a memory 
search instruction to be terminated either by finding a desired 
memory content or by reaching the search limit; (2) variable- 
field-length arithmetic terminated early upon finding that the con- 
tent of the portion of the field still unprocessed ts all zeroes; (3) 
key search in a disc controller processing variable length records; 
(4) normalization of a floating point number. 


As one example, consider the case of a memory search instruc- 
tion. As shown in Figure 5, the instruction at microprogram ad- 
dress 63 can be Instruction 4 (PUSH), which will push the value 
64 onto the microprogram stack and load the number N, which is 
one less than the number of memory locations to be searched 
before giving up. Location 64 contains a microinstruction which 
fetches the next operand from the memory area to be searched 
and compares it with the search key. Location 65 contains a 
microinstruction which tests the result of the comparison and also 
is a THREE-WAY BRANCH for microprogram control. If no match 
is found, the test fails and the microprogram goes back to location 
64 for the next operand address. When the count becomes zero, 
the microprogram branches to location 72, which does whatever 
is necessary if no match is found. If a match occurs on any 
execution of the THREE-WAY BRANCH at location 65, control 
falls through to location 66 which handles this case. Whether the 
instruction ends by finding a match or not, the stack will have 
been POP’d once, removing the value 64 from the top of the 
stack. 


Am29811A instruction Set Difference 


The Am29811A instruction set is identical to the Am2910 except 
for instruction number 15. In the Am29811A, instruction number 
15 is an unconditional JUMP PIPELINE REGISTER instruction. 
This provides the ability to unconditionally branch to any address 
contained in the branch address field of the microprogram. Thus, 
an unconditional N-way branch can be performed. Use of this 
instruction as opposed to a forced conditional jump pipeline in- 
struction simply allows the condition code multiplexer select field 
to be shared (formatted) with other functions. 


TYPICAL COMPUTER CONTROL UNIT ARCHITECTURE 
USING THE Am2910 


The microprogram memory control unit block diagram of Figure 6 
is easily implemented using the Am2910. This architecture pro- 
vides a structured state machine design capable of executing 
many highly sophisticated next address control instructions. 


-The architecture of Figure 6 shows an instruction register capable 


of being loaded with a machine instruction word from the data 
bus. The op code portion of the instruction is decoded using a 
mapping PROM to arrive at a starting address for the mi- 
croinstruction sequence required to execute the machine instruc- 
tion. When the microprogram memory address is to be the first 
microinstruction of the machine instruction sequence, the 
Am2910 next address control selects the multiplexer D input and 
enables the three-state output from the mapping PROM. When 
the current microinstruction being executed is selecting the next 
microinstruction address as a JUMP function, the JUMP address 
will be available at the multiplexer D input. This is accomplished 
by having the Am2910 select the next address multiplexer D input 
and also enabling the three-state output of the pipeline register 
branch address field. The register enable input to the Am2910 
can be grounded so that this register will load the value at the 


Am2910 D input. The value at D is clocked into the Am2910’s 
register (R) at the end of the current microcycle, which makes the 
D value of this microcycle available as the R value of the next 
microcycle. Thus, by using the branch address field of two se- 
quential microinstructions, a conditional JUMP-TO-ONE-OF- 
TWO-SUBROUTINES or a conditional JUMP-TO-ONE-OF- 
TWO-BRANCH-ADDRESSES can be executed by either se- 
lecting the D input or the R input of the next address multiplexer. 


When sequencing through continuous microinstructions in mi- 
croprogram memory, the program counter in the Am2910 is used. 
Here, the control logic simply selects the PC input of the next 
address multiplexer. In addition, most of these instructions ena- 
ble the three-state outputs of the pipeline register associated with 
the branch address field, which allows the register within the 
Am2910 to be loaded. The 5 x 12 stack in the Am2910 1s used for 


looping and subroutining in microprogram operations. Up to five 
levels of subroutines or loops can be nested. Also, loops and 
subroutines can be intermixed as long as the five word depth of 
the stack is not exceeded. 


CCU TIMING 


The minimum clock cycle that can be used in a CCU design is 
usually determined by the component delays along the longest 
“pipeline-register-clock to logic to pipeline-register-clock” path. 
At the beginning of any given clock cycle, data available at the 
output of the microprogram memory, counter status, and any 
other data and/or status fields, are latched into their associated 
pipeline registers. At this point, all delay paths begin. Visual 
inspection will not always point out the longest signal delay path. 
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Figure 6. A Typical Computer Control Unit Using the Am2910. 
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The obviously long paths are a good place to start, but each 
definable path should be calculated on a component by compo- 
nent basis until the truly longest logic signal path is found. 


Referring to Figure 6, a number of potentially long paths can be 
identified. These include the instruction register to pipeline regis- 
ter time, the pipeline register to pipeline register time via the 
condition code multiplexer and the status to pipeline register time. 
In order to demonstrate the technique for calculating the AC 
performance of the Am2910 state machine design, the timing 
diagrams of Figure 7 are presented. Here, a number of propaga- 
tion delay paths are evaluated such that the reader can learn the 
technique for performing these computations. 


All of the propagation delays have been calculated using typical 
propagation delays because at the time of this writing, the charac- 
terization of the Am2910 has not been completed. When the final 
data sheet is published, the user need only select the appropriate 
worst case specifications and he can compute the desired 
maximum propagation delays for his design. Also, by looking at 
the typical propagation delay numbers, the designer will be able 
to evaluate the design margin in the system after he has com- 
pleted all of the worst case calculations. These typical prop- 
agation delays represent the expected values if a system were 
set up on the bench and actual measurements would be taken at 
5V and 25°C operating temperature. 


While Figure 6 and Figure 7 deal with the Am2910 microprogram 
sequencer, itis also instructive to evaluate the AC performance of 
a typical computer control unit using the Am2911 and 
Am29811A. Figure 3 shows such a connection and will be used 
as the basis for performing the propagation delay path calcula- 
tions. The calculations for the various propagation delay paths 
are demonstrated in Figure 8 and are intended to show the 
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technique for computing these delays. As before, the typical 
propagation delays have been used In the computation for com- 
parison purposes. The user can derive the maximum numbers at 
25°C and 5V, commercial temperature range and power supply 
variations or military temperature range and power supply varia- 
tions as required for his design. 


When Figure 7 and Figure 8 are reviewed in detail, the reader will 
recognize that the longest propagation delay paths in the case of 
the Am2910 as well as the Am2911 and Am29811A involve the 
three-state enables on the map PROM or the pipeline register for 
the branch address. If absolute maximum speed is desired, these 
paths can be eliminated by using one of several techniques. One 
technique is to simply allocate one or more bits in the pipeline 
register to control the three-state enables of the various devices 
connected to the D input of the Am2910. For the example of 
Figure 6, one bit would be sufficient and the pipeline register 
could be implemented using an Am74S175 register. This would 
allow the true and complement outputs to be used to drive the 
pipeline register branch address output enable and the mapping 
PROM output enable. Thus, these longest paths would be elimi- 
nated and an improvement of about 30ns would be achieved. A 
second technique for eliminating these propagation delay paths 
would be to use a four input NAND gate and a four input NOR gate 
to encode the equivalent function of the MAP enable and the PL 
enable. This technique is demonstrated in Figure 9. Again, an 
Am74S175 register would be used as the pipeline register to 
provide the instruction inputs to the Am2910 sequencer. This 
would allow instruction 2 to be decoded to provide the MAP 
enable signal and “NOT INSTRUCTION 2” to be decoded as the 
pipeline enable signal. This technique can be applied as well to 
the computer contro! unit of Figure 3 to accomplish the same 
longest path elimination. 
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Figure 7. Propagation Delay Calculations on the Am2910 Microprogram Sequencer. 
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Figure 7. Propagation Delay Calculations on the Am2910 Microprogram Sequencer (Cont.). 
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Figure 7. Propagation Delay Calculations on the Am2910 Microprogram Sequencer (Cont.). 
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Figure 8. Propagation Delay Calculations for the Am2911 and Am29811A Design. 
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Figure 8. Propagation Delay Calculations for the Am2911 and Am29811A Design (Cont.). 
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Figure 8. Propagation Delay Calculations for the Am2911 and Am29811A Design (Cont.). 
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Figure 8. Propagation Delay Calculations for the Am2911 and Am29811A Design (Cont.). 
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Figure 8. Propagation Delay Calculations for the Am2911 and Am29811A Design (Cont.). 
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Figure 9. Using NAND and NOR Gates to Improve Am2910 Speed. 


In order to compare the performance of the Am2910 with the 
Am2911 and Am29811A, Table 5 ts presented. Here the prop- 
agation delays for the Am2911 and Am29811A are for a 12-bit 
wide microprogram sequencer configuration. If a wider configu- 
ration is used, only one additional carry input to carry output delay 
must be added to the appropriate paths of these calculations. A 
12-bit wide Am2911/29811A configuration has been evaluated 
so that an “apples to apples” comparison can be made. 


As Is shown in Table 5, anumber of combinations are possible for 
the longest AC propagation delay paths for these microprogram 
sequencers. First, the continue instruction can be executed the 
fastest of any of the microprogram instructions if the continues 
are sequential. That 1s, from the second continue on, the typical 
microcycle can be either 61 or 64ns respectively. To achieve this 
speed, it is required that various signals throughout the architec- 
ture be stable such that the only paths that enter into the propaga- 
tion delay calculation are the clock-to-output of the microprogram 
counter, the microprogram memory and the pipeline register 
setup. 


The second group of instructions shown in Table 5 show some 
examples of instruction execution and jumping. These examples 
assume that the MAP and OE outputs are not used as described 
earlier. These calculations apply to several of the instructions but 
not to all the instructions. For the Am2910 sequencer ail of the 
propagation delays are around 80 to 85ns; while for the 
Am2911/Am29811A combination, the propagation delays range 
from about 80ns to 100ns, depending on the instruction. It should 
be noted that certain other instructions such as push and condi- 
tional load counter should be evaluated to determine the speed at 
which they can be executed. 


The last two instructions shown in Table 5 are for jumps where the 
output enable of the field supplying the address to the D inputs of 
the microprogram sequencers are controlled by either the 
Am2910 or Am2981 1A. Notice that for Am2910 configuration, the 
jump map represents the longest propagation delay path and is 
103ns typical. Also, for the Am291 1/Am29811A combination, the 
jump map instruction also represents the longest propagation 
delay path and is 109ns typical. ; 


It is not the purpose of this exercise to show every possible 
propagation delay path; but rather, to show the reader the 
technique for computing propagation delays such that any design 
can be evaluated and the worst case past derived. Even here, not 
all of the worst case numbers shown in Table 5 have been derived 
in Figures 7 and 8. This was done intentionally and ts left as an 
exercise for the student. 


If the Am2909 or Am2911 and the Am29811A are combined into 
microprogram sequencers of either 8 bits in width or 16 bits in 
width, the calculations need only be modified slightly to determine 


the microcycle times. Obviously, if two Am2911s are used, the 
worst case propagation delay paths do not change. However, if 
four Am2911s are used, the carry path will become the longer 
propagation delay path on several of the computations. This may 
be offset however since larger microprogram PROMs may be 
used if 64K of microcode is actually being addressed or high 
power buffers may be placed between the Am2911 outputs and 
the microprogram memory to provide sufficient drive for such a 
large microprogram store. 


In addition, the Am2909 and Am2911 may be used without the 
Am29811A where the user wishes to generate a special purpose 
instruction set or very high speed control of the internal multi- 
plexer and push pop stack. In some, designs as much as 25 to 
30ns, typical, can be removed from the longest propagation delay 
paths of the design by using high speed Schottky SSI. While this 
has not been the typical case, some designers have used it to 
provide a performance improvement not achievable with a stan- 
dard Schottky condition code multiplexer and the Am29811A 
next address control unit. 


APPLICATIONS 


It should be understood that the microprogram state machine 
built using either the Am2910 or the Am291 1/2981 1A represents 
a general purpose state machine controller. Applications for this 
type of microprogrammed control include uses in minicomputers, 
communications, instrumentation, controllers and peripherals as 
well as special purpose processors. Typically, the micropro- 
grammed approach provides a more structured organization to 
the design and allows the design engineer the greatest flexibility 
in implementation. 


It is important to understand that microprogrammed machines 
need not be part of a typical minicomputer type structure. That ts, 
a general purpose minicomputer usually has a machine instruc- 
tion set that is totally different from its microprogram instruction 
control. As such, it is essential that the designer new to computer 
design and microprogram design understand the difference be- 
tween a machine instruction and a microprogram instruction. This 
differentiation is shown in Figure 10 where a typical 16-bit 
machine level instruction is demonstrated as compared with a 
typical microprogram instruction. The machine level instruction 
usually consists of 16 bits and in this example, these bits are used 
to provide the op code, source register definition and destination 
register definition. The microprogram instruction on the other 
hand usually consists of anywhere from 32 to 128 bits in a typical 
minicomputer type design. Here, the bits are used to control the 
elemental functions of a machine such as the Am2910 instruction 
control and condition code multiplexer, the Am2903 source, ALU 
function and destination control and so forth. For purposes of this 
explanation, let us assume that the machine level instruction is 
available to the machine programmer while the microprogram 


TABLE 5. SUMMARY OF LONGEST AC PATHS FOR MICROPROGRAM SEQUENCERS. 


Am2911 
Am29811A 
Continue The fastest instruction. 
Assumes sequential continues! 


Instruction Execute 84 88 If the MAP and PL outputs 
Jump Map (no OE) 83 78 are not used. 

Jump PL (No OE) 78 101 

Jump Map (via OE) 103 109 If the MAP and PL outputs 
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Figure 10. Understanding Machine and Microprogram Instructions. 


instruction is not available to the machine programmer at the 
assembly language level. Let it suffice to say that this assumption 
is not necessarily valid in machines being designed today. 


Perhaps one of the most typical applications of the micropro- 
grammed computer control unit state machine design is as the 
controller for a minicomputer. Here, the function of the micropro- 
grammed controller is to fetch and execute machine level instruc- 
tions. The flow required to perform this function 1s depicted in 
Figure 11 which should be representative for all general purpose 
type machines. Figure 11 shows that after initialization, the com- 
puter control unit simply fetches machine instructions, decodes 
these instructions and then fetches the required operands such 
that the original instruction can be executed. This cycle of fetch- 
ing and executing instructions is performed without end. Such 
things as hardware halts or resets are ignored and should be 
assumed to only cause re-initialization. 


Once the flow of a typical computer control unit is understood, itis 
possible to evaluate a number of architectures using the Am2910 
or Am2911/Am29811A such that the flow diagram of Figure 11 
can be implemented. 


STATE MACHINE ARCHITECTURES 


After a machine instruction is fetched from memory, It is normally 
placed in the machine instruction register as described in Figure 
6. Then the op code portion of the instruction is decoded so that a 
sequence of microinstructions in the microprogram memory can 
be selected for execution. Each microinstruction is fetched and its 
contents placed in the pipeline register as shown in Figure 6 for 
execution. 


While the architecture of Figure 6 is recommended and has been 
used throughout the preceding portion of this chapter, it should be 
understood that a number of architectures are possible using 
these microprogram sequencers. The normal flow in fetching 
microinstructions is to determine the address of the next mi- 
croinstruction, fetch the contents at that address and set up this 
data at the input of the pipeline register such that it can be clocked 
into the pipeline register for execution. If we assume that a clock 
Is being used to clock the pipeline register, the Am2910, the 
machine instruction register and the Am2903 microprocessor bit 
slices, it is possible to define a number of computer control unit 
designs where the relationship between the clock edges is dif- 
ferent. 


There seem to be a minimum of seven different architectures that 
can be defined based on placing registers in the appropriate 
signal paths and storing data on the low-to-high transition of the 
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Figure 11. Computer Control Flow Diagram. 


clock. For purposes of this discussion, we will assume that all 
clocked devices will operate using the same clock such that 
changes will occur on the LOW-to-HIGH transition of the clock. 
While it is possible to use multiphase clocks and tie different clock 
phases to different devices, that type of system operation will not 
be described here. In all cases, we will be talking about the flow of 
signals between LOW-to-HIGH transitions of the clock. Typically, 
acycle is started by a clock edge at a device and the signals begin 
to flow from one device to the next until a set-up time to a clock 
edge results. Then, the next microinstruction is executed in 


exactly the same manner. There are three different identifiable 
types of microinstruction sequences where only one register is in 
the signal flow loop. The first of these we shall call an Address- 
Based microinstruction cycle. It usually starts with the address of 
a microprogram memory word being stored in a register by the 
clock. This address has been determined by the previous mi- 
croinstruction. This address then accesses the microprogram 
memory to fetch its contents which are presented at its outputs to 
control the Arithmetic Logic Unit and the results of the Arithmetic 
Logic Unit function may be used to determine the next address 
selected that will be stored in this microprogram address register. 
This is shown as Figure 12a. The second type of microprogram 
architecture Is called Instruction-Based. Here, the register is 
placed at the output of the microprogram memory as shown in 
Figure 12b. Again, the cycle consists of executing the mi- 
croinstruction in the ALU; perhaps using the results of the opera- 
tion to determine the address of the next microinstruction and 
then fetching the contents of that microinstruction and setting this 
new data up at the input to the register. The third basic architec- 
ture for microprogram control is called Data-Based. Here, a regis- 
ter is used to hold the status data from the ALU and this is the 
determining clock point for the cycle. Here, the status register 
initiates the selection of the next address from which the micro- 
programmed data is fetched and this microprogram instruction is 
used to execute a new function in the ALU thereby setting up the 
results for the status register. This scheme is shown in Figure 
12c. Note that this scheme requires an additional register at the 
output of the microprogram memory to hold a portion of the 
microprogram instruction for controlling the condition code mul- 
tiplexer and Am2910 instruction set. These primitive architec- 
tures for microprogrammed control demonstrate the three points 
at which a register can be placed to provide a start and an end for 
the microcycle. In a general sense, each of these three architec- 
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tures is one level pipelined. This, however, is not the definition 
normally associated with pipelining of microprogram control. 


If combinations of the above described architectures are im- 
plemented, an improvement in performance will be realized. In 
each of the three architectures thus described (address-based, 
instruction-based, and data-based), ail of the signal paths are in 
series and must be transcended before a microcycle can be 
completed. They are quite easy to program, however, since all of 
the tasks are completed in the loop before proceeding to the next 
microinstruction. As stated earlier, these tend to be the slowest of 
the possible architectures for microprogram control. This disad- 
vantage can be overcome by using a technique referred to as 
pipelining in microprogram control. In a pipeline architecture, we 
overlap the fetch of the next microinstruction while we are exe- 
cuting the current microinstruction. This is achieved by inserting 
additional registers in the overall path such that we can hold the 
signals step-by-step. There are three possible combinations of 
the above mentioned architectures that can be utilized in micro- 
program control. These are address-instruction-based, 
address-data-based, and instruction-data-based. While each of 
these represent two stages of pipelining, we normally refer to 
these as the pipelined architectures. These are shown in Figure 
12d, 12e and 12f. It is the instruction-data based architecture that 
is recommended for the Am2910 and provides the overall best 
trade-off in cost versus performance. 


The last possible architecture using registers in the signal path is 
a combination of all three architectures and is called address- 
instruction-data-based microprogram control and ts shown in 
Figure 12g. Here, three stages of pipeline are involved and we 
normally refer to this as two-level pipelined archiecture. Needless 
to say, if no pipelining were involved at all, we would have a ring 
oscillator. 


(b) Instruction Based 
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Figure 12. Standard Microprogram Control Architectures. 
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Figure 12. Standard Microprogram Control Architectures (Cont.). 
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(g) Instruction-Address-Data Based 


MAP 





MICROPROGRAM 
MEMORY 


Be 
2 






CLOCK 









PIPELINE ~ - 3é, Bee 
REGISTER HR em 





MPR-485 


Shaded Lines Show Required Signal Flow to Complete a Microcycle: 
Determine Address, Fetch Instruction and Execute. 


Figure 12. Standard Microprogram Control Architectures (Cont.). 


The advantage of the instruction-data-based architecture is that 
the address and contents of the next microinstruction are being 
fetched while the current microinstruction tn the pipeline register 
(Figure 6) is being executed. This allows a shorter microcycle 
since the microprogram memory fetch and ALU execution can be 
operated tn parallel. The results of this type operation are dem- 
onstrated in Figure 13 where we see a typical timing diagram of 
the microprogram execution of the address-data-based instruc- 
tion architecture. It should be noted that when the computational 
aspects of a microinstruction are not completed in the same 
microcycle, they obviously cannot be used to determine the ad- 
dress of another microcycle until the computation has been com- 
pleted and stored in the status register. Thus, this pipelined 
architecture offers significant’speed improvement except in the 
case of certain conditional jumps. In other words, the conditional 
jump may not use the status register information of the im- 
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mediately preceding microinstruction because the computation ts 
just being performed. For this architecture, the conditional jump 
fetch must be executed on the cycle after the status register 
contains the proper execution results. This can be seen by study- 
ing Figure 13. In most microprogram designs this is not a disad- 
vantage because other housekeeping and ALU operations can 
be performed while the address of the next microinstruction is 
being determined using the current contents of the status regis- 
ter. ‘While it is not directly pertinent to the discussion at this time, 
let us point out that the Am2904 has been designed such that the 
machine architect can utilize both instruction-data-based ar- 
chitecture as well as instruction-based architecture If no house- 
keeping is required. Thus, the Am2910 and Am2904 can be used 
in a variable architecture cycle to achieve maximum performance 
for the machine. 


CLOCK _| LI LJ LI LI LI 


FETCH FETCH 
MEMORY y-INST 1 u-INST 1+1 


PIPELINE REG u-INST 1-1 w-INST 1 


EXECUTE EXECUTE 


a u-INST 1—1 p-INST 1 


ACCUMULATOR & 
STATUS REG 


RESULT OF 
p-INST 1-2 


RESULT OF 
y-INST 1—1 





FETCH FETCH FETCH 
weINST 1+2 u-INST 14+3 H-INST 1+4 


p-INST 144 


p-INST 1+2 INST 1+3 


> 


EXECUTE 
peINST 1+1 


EXECUTE 
p-INST 1+ 3 


EXECUTE 
u-INST 1+2 


RESULT OF 
peINST t+2 


RESULT OF RESULT OF 
p-INST I z-INST 1+ 1 


MPR-486 


Figure 13. Timing Diagram of Microprogram Execution. 
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The Am2910 in Computer Control 


A general state machine design using the Am2910 is shown in 
Figure 14. Here, all three output enables are used to advantage in 
order to control the mapping PROM, pipeline register and vector 
PROM in this design. This design is very straightforward and in 
fact is identical to that shown earlier. 


One area that should not be overlooked is that of initializing the amazon 


Am2910 at power up. One technique for accomplishing this is to 
use a pipeline register with a clear input to provide all LOWs to the 
instruction inputs of the Am2910. This will cause a reset of the 
stack in the Am2910 and force the outputs to the zero word and 
microcode which can be used for the initialization routine. Typt- 
cally, power up will result in the firing of a timer which can be 
connected to the clear input of the register. Figure 15 shows the 
technique for initializing the Am2910 using this method. 


One advantage of the Am2909 when compared to either the 
Am2910 or Am2911 is the OR inputs to the microprogram ad- 
dress field. These OR inputs allow two, four, eight or 16-way ——$—$—$—-—_—_— 
branching for each device if proper control is used. This control ee er 
can be accomplished using the Am29803A, 16-way branch con- 

trol unit. A typical computer control unit using the Am2909, 
Am2911, Am29803A and Am29811A Is shown in Figure 16. In 
this example, the least significant microprogram control se- 
quencer is an Am2909 and the two more significant sequencers 
are Am2911s. Figure 15. Initializing the Am2910. 
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Figure 16. A High Performance Microprogram Controller Using the Am2909, Am29811A and Am29803A. 


DETAILED DESCRIPTION OF THE Am2911 AND 
Am29811A IN A COMPUTER CONTROL UNIT 


The detailed connection diagram of a straight-forward computer 
control unit is shown in Figure 17. This design features all of the 
next address control functions described previously and a few 
features have also been added. 


Referring to Figure 17, the instruction register consists of two 
Am25LS377 Eight-Bit Registers with Clock Enable. These reg- 
isters are designated as U1 and U2 and provide ability to selec- 
tively load a 16-bit instruction. This particular design assumes 
that the instruction word consists of an eight-bit op code as well as 
eight bits of other data. Therefore, the op code is decoded using 
three 256-word by 4-bit PROMs. The Am29761 has been 
selected for this function and is shown in Figure 17 as U3, U4 and 
US. 


The basic control function for the microprogram memory is pro- 
vided by the Am2911s. In this design, three Am2911s (U6, U7, 


Note: Figures 17, 18, 20, and 24 are at back of the book. 


and U8) are used so that up to 4K words of microprogram mem- 
ory can be addressed. The microprogram memory can consist of 
PROMs, ROMs, or RAMs, depending on the particular design 
and the point of its development. This particular design shows the 
capability of a 64-bit microword; however, the actual number of 
bits used will vary from design to design. 


The pipeline register associated with the computer control unit 
consists of five integrated circuits designated U16, U17, U18, 
U19 and U20. 


One of the features of the architecture depicted in Figure 17 is the 
event counter shown as U9, U10 and U11. This event counter 
consists of three Am25LS163s connected as a 12-bit counter. 
The counter can be parallel loaded with a 12-bit word from 
pipeline registers U18, U19 and U20. The multiplexer and D-type 
flip-flop (U21 and U22) at the counter overflow output (U9) is 
present to improve system cycle time and will be described in 
detail later. 


40 


MULTIPLEXER SELECT 





R290 Rig Rig R17 SELECT 


0 0 0 0 TEST O 
0 0 0 1 TEST 1 
0 0 1 0 TEST 2 

e @ 

e t 

e @ 


TEST 15 


POLARITY CONTROL 







OUTPUT 


COMPLEMENT 
OF TEST 
TRUE TEST 






NEXT ADDRESS CONTROL 






FUNCTION 
NEXT 


Ris Rig Riz Ri2 






Xx x X x 


MACHINE INSTRUCTION REGISTER 


FUNCTION 





CONTROL VALUE 






FUNCTION 
XXX---XXX 


JUMP ADDRESS 





FUNCTION 


XXX ---XXX JUMP ADDRESS 












INSTRUCTION 







R17 R19 Rg Rg 
6] 5] 4] 311 





















Wp 1C, 1p 1Cy D ¢ B ACLR 





D C B A CLR 
10 15 
























co co! 
U24 is Vio ul U4 a R21 1 
Am25LS153 Am25LS$163 Am25LS163 
1Y cP [TOAD P cP 
oe ee) ; 
3 
Do 
CLR 22 cP 
Am74S174 
Q 
TEST 14 
TEST 13 4 
TEST 12 Ue 
JeeLy Am74S251 
TEST 10 
= nai 
TEST 8 
U43 
Am74S158 
TEST 7 5 
TEST 6 | | fis! 
TEST 5 | fsa) 5 
TesT4 tele ais ; 
leer ae Am748251 7 
TEST 2 | | {2 
rests | |_{2 8 
TEST 0 mz 9 
10 
11 


Q4 Q3 QA, QA, 
CLR U 17 5 
Am74S174 


12 
i 

13 
14 


15 


(Ro—Ryq) 
16 


Figure 17. Computer Control Unit with Am2911. 


At 


Dy5 044013042 D4, 049 Dg Dg 16-BIT DATA BUS D7 Dg Ds Dg D3 Dz Dy Do 


18 FL TLE 


8D 7D 6D 5D 4D 3D 2D 1D 8D 7D 6D 5D 4D 3D 2D 1D 















1 Uy 


1 E U2 
Am25LS377 


Am25LS$377 


1 


cP cP 


80 70 6Q 50 40 30 20 10 8Q 70 6Q 50 40 30 20 10 


OTHER 


A7 Ag A5 Ag Az Az Aq Ag 
Us 
Am29761 


BRi1 
BRi9 
BRg 
BRe 
BR 
BRe 
BR 
BRy 
BR3 
BR5 
BR, 
BRo 


Voc 


CLOCK 


M17 Mig Mg Mg 











60 56 52 48 #44 +%.\40 36 32 2 2 20 416 12 8 4 0 
(12-21) 
(0-11) 
SS ae ee ee 
14 
7 
15 O 
Am2918 Am2918 Am2918 
Og 05° Oy 0G Na. Yo. 7: ¥o Q3 9) 91 A V3 Y2 V1 Yo 30904 09 Ng Yo: Yq YO 
14/11] 5] 2/13] 10] 6] 3 14/11] 5] 2] 13/10] 6] 3 14) 11] 5] 2/13] 10] 6] 3 
Ri7 Rig Ro Rg | BRiq | BRg R7 Rg Rg Rq | BRg | BRy Rg Rp Ry Ro | BR2 | BRo 
BR, BRg BR7 BR, : BR3z BR, 
16 


as ee, vat itl 


42 


This design also features a 16-input condition code multiplexer 
using two Am74S251s, which are designated U12 and U14. 
Condition code polarity control capability has been added to the 
design by using an Am74S158 Two-Input Multiplexer designated 
as U13. The W outputs and Y outputs from U12 and U14 have 
been connected together but only one set of outputs will be 
enabled at a time via the three-state control signal designated as 
Roo and Roo. Since the Y output is inverting and the W output is 
non-inverting, the two-input multiplexer, U13, can be used to 
select the test condition as either inverting or non-inverting. This 
allows the test input on the Am29811A Next Address Control 
Unit, U15, to execute conditional instructions on either the in- 
verted or non-inverted polarity of the test signal. For example, a 
CONDITIONAL BRANCH may be performed on either carry set 
or carry reset. Likewise, the same CONDITIONAL BRANCH 
might be performed on either the sign bit as a logic one or the sign 
bit as a logic zero. Note that the Am2981 1A Next Address Control 
Unit has eight outputs. Four outputs to control the Am2911’s So, 
S,, PUP and FE inputs. Two outputs to control the three-state 
enables of the devices connected to the D inputs, i.e., a map 
enable (MAP E) to select the mapping PROMs and a pipeline 
enable (PL E) to enable the three-state Am2918 outputs which 
make up a 12-bit wide branch address field. The remaining two 
Am29811A outputs are for loading and enabling the Am25LS163 
counters. CNT ENABLE from the Am2981 1A is active-LOW while 
the Am25LS163 counter requires an active-HIGH enable, there- 
fore CNT ENABLE from the Am29811A is passed through one 
section of the Two-Input Multiplexer (U13) for inversion. An alter- 
native counter, the Am25LS169, has enable as active-LOW; 
therefore, this inversion through U13 is not required. 


At this point, a discussion of the typical operation of this computer 
control unit is in order. First, bits 0-11 of the microprogram mem- 
ory output word, are connected to the pipeline register desig- 
nated U18, U19 and U20. The Am2918 has been selected for this 
portion of the pipeline register because of its continuous outputs 
and three-state outputs. The three-state outputs are connected to 
the D inputs of the Am2911 to provide a branch address 
whenever needed. These 12 bits are designated BRg-BR,,. The 
Q outputs of these same Am2918s are designated Ro-R,1 and 
are connected to the parallel load input of the Am25LS163 
Counters. Thus, the counter can be loaded with any value be- 
tween 0 and 4,095. Many designs will take advantage of Ro-R, 
and use it as a general purpose field whenever the counter is not 
being loaded or a jump pipeline is not being performed. Using a 
microprogram memory field for more than one function (branch 
address and counter load value in this example) is called FOR- 
MATTING and will be covered in greater detail later. The other 
two devices in the pipeline register shown on the architecture of 
Figure 17 are U16 and U17. First, U17 receives four bits (12, 13, 
14 and 15) from the microprogram memory to provide four-bit 
instruction field to the Am29811A. This four-bit field, designated 
R4y2-R45, provides the actual next address control instruction for 
the computer control unit. R1. is the polarity control bit for the test 
input and is connected to the select input of the Am74S158 
Two-Input Multiplexer. When Ry,¢ is LOW, the signal at the 
Am29811A test input will be inverted, but when Ry. is HIGH, the 
test input will be non-inverted. 


The Am74S175 has been used as part of the pipeline register 
(U16) because it has both inverting and non-inverting outputs. 
Signals R,7, Rig and Ryg are used to control the One-of-Eight 
Multiplexer (U12 and U14) A, B and C inputs. Pipeline register 
output Rep and Ryo are used to enable either the U12 outputs or 
the U14 outputs such that a one-of-sixteen multiplexer function is 
implemented. In this design, the TEST 0 input of U14 is con- 
nected to ground. This provides a convenient path for converting 


Note Figures 17, 18, 20, and 24 are at back of the book. 


any of the conditional instructions to non-conditional instructions. 
That is, any of the conditional instructions can be executed un- 
conditionally by selecting the TEST 0 input which is connected to 
ground and forcing the polarity control to either the inverting or 
non-inverting condition. This allows the execution of uncondi- 
tional JUMP, unconditional JUMP-TO-SUBROUTINE, and un- 
conditional RETURN-FROM-SUBROUTINE instructions. 


Bit 21 from the microprogram memory utilizes a flip-flop in U17 as 
part of the pipeline register. This output, Ro;, is used as the 
enable input to the instruction register. Needless to say, other 
techniques for encoding this enable function in a formatted field 
could be provided. 


A HIGH PERFORMANCE COMPUTER CONTROL UNIT 
USING THE Am2909 AND Am29803A 


The high performance CCU (Figure 18) is of a similar basic 
design as the previously described CCU. The major differences 
are, referring to Figure 18, the addition of an extended enable 
control (U16), a vector input (U24 and U25), and an Am29803A 
16-way Branch Control Unit (U23). These performance en- 
hancements are more related to function than to actual circuit 
speed. The use of these enhancements by the microprogram 
provides greater flexibility in controlling a machine's environ- 
ment, and can reduce the microinstruction count required to 
perform a particular task, which has the effect of increasing 
overall system throughput. 


In describing this high performance CCU design, those sections 
which remain unchanged from the previous description (Figure 
17), will not be covered again. This includes the mapping 
PROMs, sequencer, Am29811A, counter, condition test inputs 
and associated polarity control, and the pipeline register. The 
areas that will be covered are: extended enable control (U16), 
Vector inputs (U24 and U25), and the Am29803A 16-way Branch 
Control Unit (U23). 


Extended Enable Control 


Extended enable control is accomplished via an Am74S139 dual 
two-to-four line decoder in conjunction with the Am29811A next 
address control unit. In Figure 17, PL E and MAP E of the 
Am29811A were connected directly to the components that they 
are to control (pipeline registers and mapping PROMs, respec- 
tively). Likewise, CNT LOAD and CNT ENABLE are connected 
directly to the counters that they control (with the exception that 
CNT ENABLE requires inversion when_ using Am25LS163 
counters). In Figure 18, PL E, MAP E, CNT LOAD and CNT 
ENABLE go to the inputs of the Am74S139 two-to-four line de- 
coder (U16). When either PL E or MAP E is LOW, then either 2Y, 
or 2Y> of U16 is LOW and either the pipeline branch address 
registers or mapping PROMs are enabled. If both PL E and MAP 
E are HIGH, then output 2Y3 of U16 is LOW enabling the three- 
state outputs of U24 and U25 which are alternate microprogram 
starting address decoders (alternate mapping PROMs), and 
called VECTOR INPUT in this design. Likewise, CNT LOAD and 
CNT ENABLE follow the same rules, enabling the counter to load 
or count via 1Y,; and 1Y2 of U16. 











Vector Input 


The “Vector Input” provides the system designer with a powerful 
next starting address control. For example, one possible use 
might be as an interrupt vector. For instance, use the “Interrupt 
Request” output of an Am2914 Vectored Priority Interrupt Con- 
troller (or group of Am2914s) as an input to one of the conditional 
test inputs of multiplexers (U12 or U14). Then connect the 
Am2914 Vector Out lines to the vector mapping PROMs (Vector 
input U24 and U25). The microprogram then could, at the appro- 


priate time, test for a pending interrupt and if present, jump in 
microprogram memory directly to the routine which handles the 
specific interrupt as requested via the Am2914 Vector Output 
lines. This routine will take the proper steps to preserve the status 
of the interrupt system, and then will service the interrupt. This is 
one of many possible uses for the Vector Input. Other possible 
uses include both hardware and software “TRAP” routines and 
so forth. As can be seen, the design presented here uses the 
Vector Enable line (output 2Y3 or U16) to enable an alternate 
starting address input at the Am2911. This, however, does not 
preclude the use of other devices in place of mapping PROMs as 
the D-input vector source. 


It should be understood that this does not accomplish a “micro- 
interrupt” function in that it is not a random possibility. Instead a 
microprogrammed test is made and an alternate microroutine is 
performed. A true “microprogram interrupt” is one that could 
occur at any microinstruction. The Am2910 does not handle this 
case internally. 


Am29803A 16-Way Branch Control Unit 


The Am29803A provides 16-way branch control when used in 
conjunction with the Am2909 bipolar microprocessor sequencer, 
and is shown as U23 in Figure 18 with its pipeline register U22. 
The Am29803A has four TEST-inputs, four INSTRUCTION- 
inputs, four OR-outputs, and an enable control. The four OR- 
outputs connect directly to the Am2909 OR-inputs (U8 in Figure 
18). The four INSTRUCTION-inputs to the Am29803A provide 
control over the TEST-inputs and OR-outputs, and are provided 
by the microprogram via the pipeline register U22 (Figure 18). 


Basically, the INSTRUCTION-inputs (Ip-!3) provide sixteen in- 
structions (0-F,,) which can select sixteen possible combinations 
of the TEST-inputs and provide a specific output on the OR- 
outputs depending upon the state of the inputs being tested. (The 
subscript 16 refers to basic 16.) All possible combinations of 
instruction-inputs, TEST-inputs and OR-outputs are shown in 
Figure 19. 


Note that instruction zero does not test any inputs (a disable 
instruction). Instructions 1, 2, 4 and 8 test one input and can 
cause a branch to one of two words. Instructions 3, 5,6, 9, 10 and 
12 test two inputs and can jump to one of four words (a 4-word 
page). Instructions 7, 11, 13 and 14 test three inputs and can 
jump on an eight word page. Instruction number 15 tests ail four 
inputs and the result can jump to any word on a sixteen word 
page. 

USING THE Am29803A 


In the architecture of Figure 18, the Am29803A allows 2-way, 
4-way, 8-way or 16-way branching as determined by selectable 
combinations of the TEST-inputs. Referring to Figure 19, the 
ZERO instruction (all instruction bits LOW) inhibits the testing of 
any TEST-inputs, thus providing LOW OR-outputs. Any single 
TEST-input selected (To, T;, To or T3) will result in ORg being 
HIGH or LOW in correspondence with the polarity of the selected 
TEST-input. Selecting any combination of two TEST inputs re- 
sults in the outputs ORg and/or OR, being HIGH or LOW, follow- 
ing a mapped one-to-one relationship, i.e., ORg and OR, will 
follow the TEST-inputs, but no matter which pair of TEST-inputs 
are selected, their HIGH/LOW condition is mapped to the ORo 
and OR, outputs. Likewise, selecting any three TEST inputs, will 
map their HIGH/LOW condition to the ORp, OR; and OR, out- 
puts. Selecting all four TEST-inputs, of course, causes a one-to- 
one relationship to exist between the HIGH/LOW conditions of 
the TEST-inputs and the corresponding OR-outputs. Refer to 
Figure 19 to verify the relationships between INSTRUCTION- 
inputs, TEST-input, and OR-output. It is very important that the 
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mapping relationship between these signals be completely un- 
derstood. When using the Am29803A TEST-OR capability as 
shown in Figure 18, the microprogrammer must position the 
applicable microcode within microprogram memory so that the 
low-order address bits are available for ORing. Sequencer in- 
structions using the Am2909/2911 D-inputs (JRP, JSRP, JP and 
CJS in particular) are ideally suited for the Am29803A TEST-OR 
capability. The jump-to-location, available via pipeline BRo-BR , 
or the Am2909/291 1 register, can contain the address of a branch 
table. A branch table is merely a sequential series of uncondi- 
tional jump instructions. The particular jump instruction executed 
is determined by the low-order address bits; that is, the first jump 
instruction in a branch table must start at a location in micropro- 
gram memory whose low-order address bit (or bits) is zero. Ifa 
single Am29803A TEST-input is selected (2-way branching) then 
only the least significant bit in the beginning branch table address 
needs to be zero. Two Am29803A TEST-inputs selected (4-way 
branching) requires that the branch table start on an address with 
the low-order two bits equal to zero; 8-way branching requires 
three low-order zero bits, and 16-way branching requires four 
low-order zero address bits. Understanding this branch control 
concept is really quite simple. The branch table is located in 
microprogram memory beginning at a location whose address 
has sufficient low-order zero bits to accommodate the number of 
selected Am29803A TEST-inputs. If, for instance, three TEST- 
inputs were selected, the first jump instruction in the branch table 
must be at an address whose low-order three bits are zero, such 
as address OF8,,. The second jump instruction in the branch 
table would begin in microprogram memory address OF9;.. The 
third jump at location OFA;,, the fourth at OFBy., etc. Through all 
eight locations (OF 8,,_-OFF,,). Assume the following pipeline in- 
struction (referring to Figure 18): (1) U22 selects three 
Am29803A TEST-inputs, (2) U18 instructs the Am29811A Next 
Address Controller to select the Am2909/2911 D-inputs, (3) U16 
enables the pipeline branch address as the D source, and (4) 
U19, U20 and U21 supplies the address OF8,, as the branch 
address. The Am29803A TEST-inputs will be ORed into the 
low-order three bit positions, thus providing a jump entry into the 
branch table indexed by the value of the OR bits. Each instruction 
in the branch table is usually a jump instruction, which allows the 
selection of a particular microcode routine determined by the 
value presented at the Am29803A TEST-inputs. These jump 
instructions are the first instruction of the particular sequence. 
There are, of course, many other ways to use the Am29803A 
16-way Branch Control Unit. 


The microprogram memory address supplied via an Am2909 
sequencer can be modified by the Am29803A 16-way Branch 
Control Unit. Remember, however, that the microcode as- 
sociated with this address modification relies on certain address 
bits being zero, therefore this microcode is not arbitrarily relo- 
catable. The above discussion describes using the D-input and 
branching to provide low-order zeroes to use the OR inputs. 
Through proper design, the Register, PC Counter, or File can be 
used equally well. 


THE COMPLETE COMPUTER CONTROL UNIT 
USING THE Am2910 


A detailed connection diagram for a straightforward computer 
control unit using the Am2910 is shown in Figure 20. This design 
utilizes the Am25LS377 as U1 and U2 to implement a 16-bit 
instruction register. The op code outputs from the instruction 
register drive three Am29761 PROMs to perform the op code 
decoding function. These are shown in the diagram of Figure 20 
as U3, U4 and U5. The Am2910 sequencer (U6) is used to 
perform the basic microprogram sequencing function. 
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Figure 19. Function Table. 


A 16 input condition code multiplexer function is provided by 
using two Am2922s as U7 and U8. These devices allow one of 
sixteen inputs to be tested and the polarity of the test can also be 
determined. The pipeline register consists of U9, U10, U11, U12 
and U13. These devices are edge triggered D type registers and 
have been selected to provide unique functions as required de- 
pending on their bit positions in the pipeline register. An 
Am74S175 was selected for U9 because both a true and com- 
plement output were desired to provide control to the condition 
code multiplexer three state enables. An Am74S174 register was 
selected as U10 because it provides a clear input for initializing 
the Am2910 microprogram sequencer. Three Am2918s were 
selected for U11, U12 and U13 because they have a three state 
output that can be used to provide the branch address field to the 
D inputs of the Am2910 and they also have a set of outputs that 
can be used to provide other control signals via this field when it 
does not contain a branch address. No specific devices are 
shown for the microprogram memory as the user should select 
the desired width and depth depending on his design. 


ANOTHER DESIGN EXAMPLE 


The Am2909, Am2910, Am2911, Am29811A and Am29803A 
have been designed to operate in the microprogram sequencing 
section of any digital state machine. Typically, the examples 
shown are for performing the computer control unit function of a 
typical minicomputer class machine. The design engineer should 
not limit his thinking for the use of these devices simply to that of 
microprogram sequencing in a computer control unit. These de- 
vices can be successfully used in other areas of designing such 
as memory control, DMA control, interrupt control and special 
purpose microprogrammed machine architectures. In order to 
provide an example of a design using these devices in something 
other than a typical computer control unit, a microprogrammed 
CRT controller is described in the following. 


In order to provide some basis for the design of a CRT controller, 
the requirements of this controller must be spelled out. These are 
given as follows: 


A) Character size: 5 x 7 dot matrix. The character field will be 7 

dots by 10 horizontal lines thereby providing ample space for 

the 5 x 7 character and the intervening space between 
characters and lines of characters. 

80 characters per line. A standard 80 character per line dis- 

play will be utilized and there will be 18 character periods 

allowed for horizontal retrace time. 

24 lines of characters per frame. This provides a total of 240 

visible lines per frame (24 lines of characters by 10 horizontal 

lines per character). There are a total of 24 lines provided for 
vertical retrace bringing the total number of lines per frame to 

264. 

Refresh rate 60 frames per second. Therefore, the horizon- 

tal line rate will be 264 x 60 = 15,840Hz. As there are a total of 

80 + 18 = 98 character periods in a line, the character rate 

will be 98 x 15.84 = 1,552.32KHz, and the dot rate will be 7 x 

1.5288 = 10.86624MHz. (Note No interlace is used.) 

E) It is assumed that there is a 2K word deep x 8-bit wide 
character RAM available to the host computer in which it can 
write the ASCII equivalent of the characters to be displayed. If 
scrolling is to be used, the host computer must also write the 
first visible character's address divided by 169 into the 
Am25LS374 “First Address Register”. 

F) This CRT controller must generate an 11-bit character ad- 
dress that ts used by the 2K word deep character RAM. It must 
also generate the required video enable signals and the hori- 
zontal and vertical blanking signals. 
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Principle of Operation 


A detailed block diagram of the CRT controller is shown in Figure 
21. The block diagram shows an interface to an SBC-80/10 data 
bus, address bus and control bus. The outputs of the CRT control- 
ler are connected to a CRT monitor on the block diagram. Other- 
wise the block diagram shows a straightforward use of the 
Am2910 and three Am2911s to implement the CRT control func- 
tion using microprogrammed techniques. The SBC-80/10 was 
selected for this example since tt is well Known. 


A logic diagram of the CRT controller is shown in Figure 22. Three 
Am29775 512-word x 8-bit registered PROMs are used to contain 
the 23-bit wide microprogram. While only a minimum number of 
words are used in the design as shown, many additional words 
can be used to add various options (as described later). The 
address for these Am29775 registered PROMs is provided by an 
Am2910 microprogram sequencer. Three Am2911 sequencers 
are used to generate the character address for the character 
RAM. The least significant Am2911 sequencer is connected as a 
divide by 16 counter. This RAM address is compared with the 
desired last character address (80 x 24 = 1920) value using an 
Am25LS2521 8-bit equal to detector. When the last address is 
detected, it can be sensed at the condition code multiplexer 
(Am25LS153) that is used to select the condition code for the 
Am2910 sequencer. 


The data derived from the 2K word character RAM is decoded by 
a character generator (6061) in this design and the character 
output is parallel loaded into an Am25LS23 shift register. This 
shift register is used to provide the video signal from its Qp output 
to eventually drive the display via an Am74S240 buffer. The 
diagram of Figure 22 depicts an oscillator input source to supply 
the dot frequency. In this design, a 10.86624MHz oscillator 
should be connected to this oscillator input point. This oscillator 
input signal is used to clock the shift register containing the 
individual dot bits (dot-on or dot-off) and also drives an 
Am25LS169 counter which divides this frequency by 7 to gener- 
ate the character rate clock. This character rate clock is used 
throughout the controller to provide a timing signal for the state 
machine design. 


An Am25LS168 decade counter is used to generate the line 
inputs for the character generator and to count 10 horizontal lines 
per character space. This counter is clocked by the horizontal 
blanking signal (HB) and its RCO output is used as one of the 
condition code multiplexer inputs. The RCO output can be tested 
to determine when 10 counts have been executed by the counter 
and it is also used to enable the last address comparator during 
the 10th horizontal line time. 





When the host computer accesses the character RAM, the 
HOST-ACCESS line is pulled LOW. This removes the Am2911 
outputs from the character RAM address bus. When this access 
occurs, improper data may be present at the shift register inputs. 
Thus, the character generator PROM output ts disabled by the 
HOST-ACCESS signal during this time. 


When power ts applied to this CRT controller or whenever it is 
reset, the RESET line is driven LOW. This signal is inverted 
through an Am25LS240 and then disables a part of the pipeline 
register outputs as well as enabling one half of an Am25LS241. 
This Am25LS241 inserts LOWs onto the instruction (I) inputs of 
the Am2910 sequencer. Then, the next character rate clock will 
force the microprogram address outputs to zero and the micro- 
program for the CRT controller as shown in Figure 23 will be 
executed starting at address zero. 
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Figure 20. Computer Control Unit with Am2910. 
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Figure 21. CRT Controller Block Diagram. 


The Microprogram for the CRT Controller 


Table 6 shows a complete description of the microprogrammed 
CRT controller microcode. Execution of these microinstructions 
is controlled by the Am2910 sequencer. 


As can be seen in Table 6, several techniques were used in this 
short microprogram to provide the different counting require- 
ments of this CRT controller. Although only one format (80 
characters per line, 24 lines per frame) was shown here, the 
designer can easily configure his own format by simply changing 
some constants in the microprogram. As an exercise, the reader 
is encouraged to find a means to program the CRT controller for 
different formats. The host computer software could configure the 
controller format by using an additional register similar to the 
“First Address Register’. This will be discussed in an appendix at 
the end of this chapter. 


A complete wiring diagram for the microprogrammed CRT con- 
troller is shown in Figure 24. This can be used directly with the 
interface shown in Appendix A such that the CRT controller can 


Note’ Figure 24 is at back of the book. 


be connected directly to an Am9080A based microprocessor 
system. Appendix A also depicts the use of a 2K word x 8 bit 
character RAM as described previously. 


CRT Controller Timing Considerations 


As was discussed earlier, the character clock frequency for the 
CRT controller is 1,552.32KHz. Thus, it is desirable to calculate 
the longest path of the design to ensure that none exceed this 
clock period of 644.1ns. The timing diagrams of Figure 25 depict 
a number of different paths with the associated propagation delay 
calculations. 


When all of the timing diagrams of Figure 25 are examined, it will 
be found that only three show propagation delay times of over 
200ns typical. Of these, the worst case is 318ns as shown in 
Figure 25(i). Since the requirement of the design is to insure that 
none exceed 644.1ns, we have more than a 2 to 1 margin in the 
design based on the typicals. Thus, we can see that the design 
will operate properly even over the full military temperature range 
and power supply variations based on this analysis. 
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Figure 22. CRT Controller. 
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Comments 


X ,Load first address from Register to 2911's file 
;Load 2910's counter with member of rows/frame — 1 
X Address supplied by 2911's file 


‘One row 5 x 16 = 80 characters 


$ 


TENTH | | if tenth (last) line of a row jump to “TENTH” subroutine 
LASTA | ,If last character jump to “LASTA” subroutine 
$ ,Wait, until horizontal invisible counts done 
MAIN ,Then do the Main routine again 


GOBACK} ,Push next addr on 2911's file jump to “GOBACK’” if not 
End of Frame 
X ,Load 2911’s file from First Address Register 
14649 ,Load 2910’s counter with number of invisible characters 
during Vert retrace divided by 16, minus 1 
X ;Push next PC to 2910's file for double 
$ ,Wait for LS2911 to count 16 
x ,Decrement 2910's counter and jump one line back if = 0 
2310 ;Load 2910's counter again with number of rows/frame — 1 
x ‘Return from subroutine 


xX -Return 


xX ‘Load zero to 2911’s file and return. 





Figure 23. Microprogram for the CRT Controller. 


TABLE 6. DESCRIPTION OF THE MICROPROGRAM FOR THE CRT CONTROLLER. 


Micro- 
program Low 
Address Order Am2911 


0 Since ZERO ts low, its output 
will be LOW. The C,, input 
(from the Pipeline Register) 
is LOW so that the micro- 
program incrementer will not 
increment. 


ZERO and C, are still LOW, 
so no change in this device. 


2 Maintaining ZERO LOW 


“MAIN” | assures the proper starting 
address. C,, is HIGH; there- 
fore, the internal PC will 


be incremented. 


Note Figure 24 ts at back of the book 


High 
Order Am2911s 


Both S; and Sg are HIGH so 
that the D inputs will be routed 
to the Y outputs. These inputs 
will come from the First Address 
Register (the Am2910 VECT is 
LOW). Cy is LOW (see left 
column); therefore the micro- 
program counter will not incre- 
ment. FE is LOW (and PUP is 
always HIGH) causing the pre- 
sent output to be pushed on 
the stack. The character ad- 
dress ts already the “First 
Character Address”. 


S; and So are LOW; thus, the 
Y outputs will be the current 
PC, (the same as the Y out- 
puts were in the previous step). 
Cr Is still LOW, therefore no 
change will occur in the PC 


With S; = HIGH, Sp = LOW 
and FE = HIGH, the Am2911 
will refer to its internal file 
(the starting address of this 
particular character-row) 
without popping 


Am2910 


The CJV instruction is Selected. 
Therefore, VECT output will be 
LOW, enabling the “First Ad- 
dress Register onto the internal 
8-bit bus. CCEN is LOW; the 
MUX is selecting a constant 
HIGH, and the sequencer will 
address the next consecutive 


microprogram address (word 1). 


LOCT ts selected and the num- 
ber of character-rows per frame 
minus 1 (2349) is loaded into 
the Am2910 register/counter. 
The sequencer addresses the 
next microinstruction. 


The Am2910 will generate the 
next microprogram address. 


Comments 


This instruction pushes the “First 
Character Address” more signif- 
icant bits onto the Am2911’s file, 
and continues to the next micro- 
instruction. 


This ts the starting location for 
the main loop. 
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TABLE 6. DESCRIPTION OF THE MICROPROGRAM FOR THE CRT CONTROLLER (Cont.). 


Micro- 
program Low High 
Address Order Am2911 Order Am2911s Am2910 Comments 


3 


4 
through 
7 


Cc 
“TENTH” 






















This Am2911 now counts up 
using its PC incrementer At 
the final count (moving from 
F46 to 0) tts C,44 output will 
be HIGH 


Initially these two Am2911s 
will not change their Y outputs 
since their C,, input is LOW. 
However, when the C,, input 
goes HIGH, the internal PC 
will increment 


With the MUX selecting the 
Cy+4 output from the least 
significant Am2911 slice, the 
CC input to the Am2910 se- 
quencer will be LOW until the 
Am2911 counts 16. CC = 
LOW will cause the next mi- 
croprogram address to be the 
pipeline register contents; this 
is also the current micropro- 
gram address (word 3). When 
Cny+4 goes HIGH, CC will go 
HIGH and together with CCEN 
= LOW, will force the Am2910 
to address the next consecu- 
tive microprogram address (4) 


This microstep will be executed 
16 times (Note that 80 = 5x 16) 































The microprogram itself is used 
as a counter in this applica- 

tion since the count Is only 5, 
the microprogram ts relatively 
short versus the memory’s depth 
and this is a convenient means 
to economize on chip count 


Same as 3. Same as 3. Same as 3, except that at each 
address, the current micro- 


program address !s written 






















The MUX selects the 
Am25LS168 ten-line-counters 
RCO as the condition code in- 
put to the Am2910 (CC) if the 
line count ts less than 10, CC 
will be HIGH and the next mi- 
croinstruction will be addressed 
If the tenth line of a character 
row is executed, CC will be 
LOW and a JUMP-TO-SUB- 
ROUTINE to an address, sup- 
plied by the pipeline register 
(“TENTH”) will be executed 


The MUX now selects the Last 
Address Comparator output for 
CC If the current more signif- 
icant bits of the character- 
address coincide with the last 
address + 1 (1920;9/16) a 
subroutine call will be per- 
formed to “LASTA” Other- 
wise, the microprogram will 
continue consecutively 


We are now at the end of a TV 
line Therefore, the Horizontal 
Blanking Signal (HB) is HIGH 
The least significant Am2911 
slice now counts the invisible 
characters during the horizon- 
tal retrace. 


Continues to count (note that it 
enters this line with an output 
of zero) 


Since C,, is LOW (see left 
column) no change occurs in 
these devices. Note that the 
Y outputs contain the more 
significant bits of the address 
of the first character of the 
next character row 































Note that 80 characters/row 

and 24 rows/frame requires a 
1920,9 word memory When 

the last memory location 

(192049) is read out, the scan will 
begin at 0 


Continues to count through 
the internal PC incrementer 


No change 
















Waiting for the least significant 
Am2911 to count to 15 This 
microstep will be executed 

as many times as necessary 
to accomplish this 


Performing a JUMP to the 
beginning of the main-loop 
(address 2). 


Continues to count At count Same as at address 3. 


15, Cnh+4 goes HIGH. 


No change until C,, goes 
HIGH, then count 
























Unconditionally (CCEN = 
HIGH) steers the micropro- 
gram to the address supplied 
by the pipeline register 
(“MAIN” = 2) 


If internal counter ts equal to 
zero, it means that 24 character 
rows were already displayed 
and we are at the bottom of the 
CRT display A vertical retrace 
period is needed and the mi- 
croprogram will continue 
sequentially. If the counter ts 
not yet zero, we do not need 

to execute the vertical retrace 
routine and the next address 
will be supplied by the pipe- 
register (“GOBACK” = 
144.) while the internal 
counter is decremented. 


It doesn’t matter what this 
device does at this microstep 
because at the next micro- 
Step it will recerve LOW on 
its ZERO input 


Continues to count 


No change 




























The decision whether the bottom 
of the CRT (End of Frame) ts 

reached or not is made internally 
in the Am2910, using its counter 


No change 
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TABLE 6. DESCRIPTION OF THE MICROPROGRAM FOR THE CRT CONTROLLER (Cont.). 






























Micro- 
program Low High 
Address Order Am2911 Order Am2911s Am2910 Comments 
D ZERO = LOW, therefore, out- Same as at address 0 Same as at address 0 As we are at the End of Frame, 
put Y = O This ts necessary the “First-Address-Register” 
to assure that Ch 44 1s LOW contents (enabled by the 
Am2910’s VECT output) Is 
pushed onto the Am2911’'s file 
Note that the Vertical Blanking 
Signal (VB) goes HIGH 
E Same as at address B The internal counter is loaded (14649 + 1) x 1649 = 235249 





with 14619, supplied by the 
pipeline register The next 
consecutive microstep is 
addressed 


equals the number of character- 
periods during vertical retrace. 
Loading 2352, directly into the 
Am2910’s counter would require 
7 bits. Usingthis scheme we 
reduce the microprogram width. 
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104 Counts When final count is No change with C, = LOW, 

reached, Cyh44 = HIGH increments with C,, = HIGH 
This has no practical affect as 
the HB signal is HIGH, and at 
the beginning of the next vis- 
ible line, the correct address 
will be fetched from the file 
(address 2) 












With CCEN = LOW and CC = 
HIGH (supplied from a con- 
stant HIGH by the MUX), the 
next address (104g) will be 
pushed onto the Am2910 file, 
the counter will not be af- 
fected and the next consecutive 
microstep will be addressed 


The MUX supplies the C,+4 
output of the less significant 
Am2911 slice to the Am2910 
CC input While this signal is 
low, the Am2910 will select 

the pipeline register as the 
source of the next microin- 
struction address The current 
address (10},;) being written 
there, this instruction will be exe- 
cuted until CC goes HIGH Then 
the next consecutive instruc- 
tion will be selected through 

the Am2910 internal PC 


11y Counts No change If the final count has been This ts the external loop of the 
reached, the next micro- double-nested loop system, 
instruction will be addressed _| which counts the vertical retrace 
and the internal stack will be interval. By adding a single mi- 
popped (adjusted) Otherwise, croinstruction the chip count 
the next microinstruction ad- was reduced 
dress will be the one residing 
on the top of the stack (which 
IS 1046). 


12y Counts No change Same as at address 1 Reinitializes the Am2910 internal 
counter with the number of 
character rows per frame. 
13, Counts No change Unconditional return from End of “TENTH” subroutine at 
subroutine (CCEN = HIGH) End of Frame (with vertical 
retrace). 
144, Counts No change Unconditional return from End of “TENTH” subroutine 
“GOBACK”’ subroutine without vertical retrace 
15y Counts Pushes zero into file Unconditional return from A one-line subroutine to reini- 
“LASTA” subroutine tialize character address to zero 


This is a preparatory step for the 
2 step “Vertical Retrace” double- 
nested loop. 






Again, this ts a possible way to 
dwell on a certain microstep 
waiting a condition to change 

its status (like address 3 through 
7) This is the internal loop of 

a double-nested loop system 














































































a) 

DEVICE NO. DEVICE PATH PATH 1 | PATH 2 

29775 CP to D 15 

2911 (A) Cato Cats ~ 

2911 (A) ZERO to Cn44 2 eee 
2911 (B) Cn to Cn+4 Am25LS153 cc VECT OE 

2911 (C) Cn (ts) Am2910 

2911 (B,C) | FE (ts) 

2911 (B) So, $1 to Cy+4 


2911 (C) Cp (ts) 


Am25LS168 rior 
CNT 


Q 


Am25LS23 
REG 





OE 
Am25LS2521 
A 
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Am91t4 ae eds : 
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PATH1 ——~ ——~ 
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29775 CP to D 15 
2911 (A) ZERO to Y 19 - - : 
2911 (B,C) | ZEROtY Am251.8374 
2911 (B, C) So, $1 to Y cc VECT OE 


9114 Am2910 


6061 
25LS23 


AtoD 
A to Out 
D to CP (ts) 











TOTAL-ns 


CLOCK 
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Am2911 
(yf | 


PATH 1 min wee me 
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MPR-492 


Figure 25. 
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c) 


DEVICE PATH PATH 2 | PATH 3 
EsDE Wie Gc etace Ene | GATICT, | eATIUE | ATT: 


CP to D 15 
Ito Y 
CCEN to Y 
CP to Y 
A (ts) 











D Y 


Am25LS374 























VECT 







Am25LS153 






Am2910 






Am25LS168 
CNT 







Am25LS23 
REG 










OE 
Am25LS2521 








A ZERO OD 
SoS 
FE 


Am2911 
(C) 
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FE 
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PATH 2 
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| DEVICE NO. | DEVICE PATH | PATH1 
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25LS153 A, Bto Y 
2910 CC to Y 
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ovae | |_| 
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29775 CP to D 
2910 Ito PL, VECT 
29775 E,toD 
25LS374 OE to Y 
2910 PC (ts) 
2911 D (ts) 
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9) 


| DEVICE NO. DEVICE PATH PATH 1 | PATH2 | PATH3 












































29775 CP to D 

2911 So, $1 to Y 

2911 CP to Y (SqSq = HL) 

25LS168 CP to RCO Y 
25LS$2521 A to Eg eneien 
25L$2521 E; to Eg 





25LS153 
2910 
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D to Y 
CC to Y 


A (ts) 






Am25LS168 
CNT 
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Am25LS23 
REG 
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Am9114 
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PATH 2 
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h) 















2911 CP to Y (SySo = HL) 
2911 CP to Cy+4 (S4Sq = HL) 
2011 Cp (ts) 

9114 Ato D 

6061 A to OUT 

25LS23 D (ts) 
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Am25LS374 
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}| DEVICE NO. | DEVICE PATH | PATH1 | PATH2 | PATH3 
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SUMMARY 


The Am2910 provides a powerful solution to the microprogram 
memory sequence control problem. The Am2910 is a fixed in- 
struction set, 12-bit wide microprogram sequencer. In addition, 
the Am2909, Am2911, Am29811A and Am29803A provide 
another solution to the microprogram sequencing problem. 
These devices are bit slice oriented and provide more potential 
flexibility to the microprogram sequencing solution. All of these 
devices are particularly well sutted for the high performance 
computer control unit and structured state machine designs using 
overlap fetch of the next microinstruction — also referred to as 
instruction-data-based microprogram architecture. 


These Am2900 family microprogram control devices offer the 
highest performance LSI solution to the problem of microprogram 
control. They provide a number of conditional-branch source 
addresses as well as conditional jump-to-subroutine and 
conditional-return instructions. In addition, several techniques for 
timed and untimed looping are provided such that loops from one 
to several microinstructions can be executed. All of the devices 
described in this chapter are competitively priced and currently 
available. In addition, all of these devices are available with 
specifications guaranteed over the full commercial temperature 
range and power supply tolerance as well as the full military 
temperature range and power supply tolerance. All of these de- 
vices undergo 100% reliability assurance testing in compliance 
with MIL-STD-883. 
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APPENDIX A 


Figure A1 shows the logic diagram of an interface circuit used to 
connect the microprogrammed CRT controller to any AmM9080A 
type processor. Sixteen address-lines, eight data lines, a 
memory-read, a memory write and an I/O write signal are as- 
sumed to be used in an active LOW polarity. 


An Am25LS2521 8-bit comparator is used to decode the addres- 
ses of the 2K by 8 character memory. This memory can be placed 
anywhere in the memory space in increments of 2K by using 5 
DIP-switches. The comparator is enabled by the presence of 
either the MMR or the MMW signal. The output of this comparator 
is the HOST ACCESS signal. 


The HOST ACCESS signal enables the two Am25LS240 buffers 
which connect the processor address bus to the character mem- 








Note Figure A2 ts at back of the book 


ory address bus. It also enables one half of an AM25LS241 buffer 
transferring the MMR or MMR active LOW signal to the proper 
data buffer enable (Am25LS240’s) and to the WE pins of the four 
Am9114 memories in case of a memory write operation. The CS 
of two of these memories are driven by Ayo while the CS of the 
other two memories are driven by Ajo, thus forming a 2K by 8 
memory space. 


An Am25LS2521 8-bit comparator is enabled by the I/OW control 
line. Ifn matches the settings of the DIP switches at the B inputs of 
the comparator, an OUT n instruction will write the data into the 
Am25LS374 “First Address Register’. 


Figure A2 shows the complete wiring diagram of this interface 
circuit. 
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Figure A1. CRT Controller. 
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Figure A2. CRT Controller. 
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APPENDIX B 


General 


A software emulation of the CRT controller was written in 
BASIC-E and run on the System 29 support processor. Figure B1 
is a printout of this program. 


Notations 


For reference purposes, each clock pulse (CP) in the program is 
numbered. The clocks are characier-rate clocks. A subscript “10” 
signifies that this variable belongs to the Am2910 (e.g. R10 = the 
contents of the Am2910 Register Counter) and similarly a sub- 
script 11 signifies the Am2911 dependent variables (e.g. Y11 — 
the Y outputs of the two more significant Am2911s). 


Usually the normal function names were used though for the 
active LOW functions the bar was deleted for simplicity. A 0 
signifies always a LOW and 1 signifies HIGH. Other abbrevia- 
tions used in the program: 


MA = Microprogram Address (Y output of the Am2910) 
CA = Character Address 
PC = Program Counter (internal) 
R = Register (internal) 
F = File (internal) 
SP = Stack Pointer (internal) 
TENC = The Am25LS168 decade counter 
L4B = The 4 least significant bits of CA (the Y outputs of 
the less significant Am2911 
CN = Carry-in into the less significant Am2911 
CN4 = Carry-out from the less significant Am2911 
CN4 = Carry-in to the next significant Am2911 
110 = The Am2910 instruction 
HB = Horizontal Blanking signal (active HIGH) 
VB = Vertical Blanking signal (active HIGH) 
CPM = Maximum Clock Pulse (at which the program 
stops) 


Description 


The different groups and subroutines of the emulation program 
are as follows: (See Figure B1). 


<1000 series: The microcode. Subroutine 50 is the 
Am25LS168 decade counter clocking routine. 
TENTH is the RCO output of this device. 

1000 series: This is essentially the Am2910 emulation. 

Note the definition of the two functions 

FNFAIL and FNPASS at the beginning of the 

program, compare to the Am2910 instruction 

definitions in its data sheet. 

The Am25LS153 multiplexer emulation. 

The less significant Am2911 emulation. Note 

that the only input to this device is ZEROL. 

CN and the internal PC (called L4B) are con- 

trolled in the CLOCK Subroutine (4000 series). 

The two more significant Am2911’s emulation, 

So and S; are treated as a single number 

(ranging from 0 through 3) and denoted by 

$11. 

The Clocking routine. 

The main emulation routine. It includes the 

Am25LS2521 comparator routine and checks 

the Clock Pulse against CPM to determine 

end of run. 

Emulation parameter setup (initialization). 

The starting and ending CP numbers, MA, 

TENC, R10 and VECTOR (The “First Address 

Register’) can be set. 

Sets up the print-out parameters 

Printout subroutine 

Sets the program mode: RUN, PRINT or QUIT 

(return to CP/M) 


2000 series: 
2500 series’ 


3000 series: 


4000 series: 
5000 series: 


5500 series: 


6000 series: 
7000 series: 
9000 series: 


The emulation was exercised to evaluate fifteen different perfor- 
mance aspects of the CRT Controller. The results indicated that 
in all cases, the design operated as desired. 
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REM 
REV=12 
PRINT REV 
7000 REM HEADER 
PRINT 
PRINT 
FROIN T 269626 99 HH HHH HEHEHE KERR HER HRRHKE KKK EREKRRERHER" 
PRINT 
PRINT , 
PRINT ° A MICROPROGRAMMED CRT CONTROLLER EMULATION" 
PRINT 
PRINT 
FROIN T "369696 16 96 26 2696 HE HEHE HEH HEE HE HHH HHH HEH EHH HHH EKER ERE" 
PRINT 
PRINT 
PRINT * BY MOSHE M. SHAVIT" 
PRINT ° ANVANCED MICRO SEVICES" 
PRINT ° FEBRUARY 27, 1978° 
FRINT 
PRINT 
REM 
DIM F10¢46) 
DEF FNFAIL=CCEN=0 ANTI CC=1 
DEF FNFASS=CCEN=1 OR CC=0O 
REM 
REM 
REM GOTO 46000 REM PROGRAM PARAMETERS (REMOVED REV 6) 
REM 
REM Z--REV 6 
KEM 
7100 PRINT 
PRINT 
PRINT 
INFUT "R-UN, P-RINT OR Q-UIT ";MODES 
IF LENCMODES)=0 THEN GOTO 9100 
MODE=ASC (MODES )-79 
IF MODE<1 OK MODE > 3 N\ 
THEN FRINT MODES; *" IS INVALID’ s\ 
GOTO 92100 
ON MODE GOTO 9110,9120,9130 
REM 
9120 RETURN 
REM 
9130 REM RUN 
FRINT 
INFUT ®PUT RESULTS ON FILE (0 IF DIRECT PRINTOUTO= “SWFILES 
PRINT "CP= "sCFs"*MA= “sMAS"VECTOR= "sVECTORs\ 
"CPM= "sCPM3*°ROW= "3 24-R10 
INFUT "INITIALIZE (CY OR Ns CF,MA=0 IF N>*3SS 
TIF S$="Y" \ 
THEN GOSUB S500 \ REM INIT. 
ELSE CF=0 =: MA=0 
IF WFILES="0" \ 
THEN GOTO 6010 \ REM DIRECT FPRINTOUT 
er ELSE FILE WFILES =: GOTO S000 KEM MAIN 
9110 REM PRINT 
FRINT 
INFUT "GET RESULTS FROM FILE=";RFILES 
FILE RFILES 

REM 

6000 REM PRINT PARAMETERS 


PRINT 


6010 PRINT "OUTPUT FORMATS: " 
Figure B1. 
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REM 
6020 


REM 
6030 


6032 


6034 


6036 


KEM 
4090 


REM 
6900 


PRINT ° A=CF AND CA ONLY" 
PRINT * B=CP,CA,HB,VB,MA" 
PRINT * C=CP,CA,MA,TENC,R10° 
PRINT * D=ALL* 

PRINT 


INFUT “FORMAT=" sFORMATS 
IF LENCFORMATS)=0 THEN GOTO 6010 
TF ASCCFORMATS)<65 OR ASCCFORMATS)>68 \ 
THEN PRINT FORMATSs;° [S ILLEGAL" =:N 
GOTO 6010 
PRINT 


REM 

IF WFILES NE "0" \ 

THEN CONTROLS="A*° =\ 
GOTO 6030 

PRINT “CLOCK CONTROL” 

PRINT " A=CONTINOUS® 

PRINT * B=STEF® 

INPUT "CONTROL=° sCONTROLS 

IF LENCCONTROLS)=0 THEN GOTO 6020 

TF ASCCCONTROLS)<65 OR ASC(CCONTROLS)>66 \ 
THEN PRINT CONTROLS;" IS ILLEGAL" =\ 

GOTO 6020 
PRINT 


PRINT "OUTPUT CONTROL" 


PRINT " A=AT EACH CF® 

PRINT ° B=AT EVERY N-TH CP" 

PRINT " C=MANUAL CONTROL * 

PRINT * N=STARTING AT CRS AT EVERY CF® 


PRINT ° E=STARTING AT CPS AT EVERY N-TH CF" 
INFUT "OQUTPUT=" ;QUTPUTS 
IF LENCQUTPUT$)=0 THEN GOTO 6030 
IF ASCCOUTPUTS)<65 OR ASCCOUTPUTS) 269 \ 
THEN FRINT OUTFUTS;" IS ILLEGAL" :\ 
GOTO 6030 
O.CTL=ASC (OUTPUTS )-64 
ON O.CTL GOTO 6090, 6032, 6090, 6034, 6036 
INPUT "N="sN 
M=0 
GOTO 6090 
INFUT "CPS= "sCFS 
GOTO 6090 
INFUT "CFS= *sCFS 
INPUT "N= "sN 
M=0 
GOTO 6090 


FORMAT = ASC(FORMATS)-64 
ON FORMAT GOSUB 6190, 4300,6200,46100 


IF WFEILE$="0" THEN GOTO 5000 REM MAIN 

PRINT 

IF END #1 THEN 4910 

FOR I=1 TO 2 STEP 0 REM OQ UNTIL END OF FILE 

REAL #1; CP,R10,F1,SF10,FPC10,CA,MUX,CC,CCEN,MA, TENC,\ 
CN4,F11,HE,VB 

F1O(SF10)=Fi 

GOSUER 7000 REM PRINT 

GOSUE 5200 REM ESCAPE (REV 7) 


IF S=155 THEN FRINTsSPRINT “ABORTED AT "sCP =: GOTO 6910 
NEXT I 


Figure B1 (Cont.) 


REM 
6910 


REM 
46100 


6190 
REM 
6200 


REM 
6300 


REM 
REM 
7000 


REM 
7002 


REM 
7300 


REM 
7400 


REM 
REM 
3000 


CLOSE 1 
OUT 100,12 REM PRINTER PAGE EJECT (REV 7) 
GOTO 7100 


PRINT 

PRINT "CP", *RI0",*F10", "SF10", "FC10" 
PRINT "CA", °*MUX*,*CC",*CCEN", "MA" 
PRINT "TENC","CN4", "Fil", *HR", "VB" 
FRINT 

RETURN 


PRINT 
FRINT "CLOCK", "CHAR.ADDR', "2910 REG.", "LINE CNTR.", "NEXT MA" 
RETURN 


PRINT 
FRINT "CLOCK", °"CHAR.ADTIK", "H.-BLANKING", "V.RLANKING", "NEXT MA" 
RETURN 


REM PRINT SUBROUTINE 
ON O.CTL GOTO 7010,7005, 7002, 7003, 7004 


INPUT "OUTPUT CY OR N)"35% 
IF S$="Y¥" \ 
THEN GOTO 7019 \ 
ELSE RETURN 


IF CP<CFS THEN RETURN ELSE GOTO 7010 
IF CPACPS THEN RETURN ELSE GOTO 7005 


M=M+1 
IF M=N THEN M=0 =: GOTO 7010 ELSE RETURN 


ON FORMAT GOTO 7100,7200, 7300, 7400 


PRINT "CF= *5CP,"CA= "5CA 
RETURN 


IF HR=0 THEN HRS="L" ELSE HRES=" H* 
IF VB=0 THEN VES="L" ELSE VE¢=" He 
PRINT CP,CA,HERS, VES ,MA 

RETURN 


PRINT 
PRINT CP,CA,R1i0, TENC,MA 
RETURN 


PRINT 
PRINT CP,R10,F10(SP10) ,SP10,FC10 
PRINT CA,MUX,CC,CCEN,MA 


PRINT TENC,CN4,F11,HB,VB 


RETURN 

REM MAIN ROUTINE 

REM 

GOSUB 4000 REM CLOCK 
REM FETCH MICROCODE 


ON MA+1 GOSUB 30,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22 
GOSUB 2500 REM 29111. 
GOSUR 3000 REM 2911H 

Figure B1 (Cont.) 
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REM 


REM 
3100 


KEM 
REM 
3200 


CA=Y11%16+L4B REM CHARACTER ADTRESS 
REM COMPARATOR NEXT 
IF Y11=120 AND TENTH=0 \ REM REV & 


THEN COMF=0 \ 
ELSE COMP =1 


GOSUB 2000 REM MUX 
GOSUR 1000 REM 2910 
REV 6 


IF WFEILES="0" THEN GOSUB 7000 \ REM DIRECT PRINTOUT 
ELSE PRINT #1;CP,R10,F10(SP10),SP10,FC10,CA,MUX, \ 
CC,CCEN,MA, TENC,CN4,Fil,HE,VB 


IF CONTROLS="B* THEN INFUT SS REM SINGLE STEF 
REM CHECK END OF RUN 

GOSUBR S200 REM ESCAPE CREV 7) 

TF S=155 THEN PRINTSPRINT "AKRORTED AT "5CP s GOTO 5100 
IF CP<CPM THEN GOTO S000 REM REFEAT MAIN 
IF WFILES NE "O° THEN CLOSE (1) 

OUT 100,12 REM PRINTER FAGE EJECT (REV 7) 

GOTQ 9100 

3200 SUB REY 7 

REM ESCAPE SUBROUTINE 

S=INF (C97) 


S=INT(S/2) 

$=§/2-INT(S/2) 

IF S NE O THEN S = INF(96) 
RETURN 


REM INITIALIZATION 

PRINT 

SPLO=1 

FRINT "MA= "5MA 

INFUT "NEW MA CY OR N)"35%9 

IF SS$="N* THEN GOTO 5510 

INPUT "MA=(O<=MAZ 22) "sMA 

MA=INT CMA) 

IF MA<O OR MAP21 \ 

THEN PRINT MAs;* IS ILLEGAL® :\ 

GOTO SS0S 

IF MA=0 THEN TENC=0 s HB=1 s TENTH=1 


PRINT 
PRINT "VECTOR= "sVECTOR 
INPUT "NEW VECTOR (CY OR ND" 55% 
IF SS="N" THEN GOTO S520 
INPUT “VECTOR=(0<=VECTOR<120) "3 VECTOR 
VECTOR=INTCVECTOR) 
IF VECTOR<0 OR VECTOR?119 \ 
THEN FRINT VECTOR;" IS ILLEGAL" s\ 
GOTO S515 


PRINT 

FRINT “CFr= °3CP 

INPUT "NEW CF CY OR ND "3S 

IF S$="N* THEN GOTO 3530 

TNFUT "CRCS=0)= "3CF 

CR=INT CCF) 

IF CP<0O THEN PRINT CFs" IS ILLEGAL" =: GOTO 3525 


PRINT 

PRINT "CPM= "sCFM 

INPUT "NEW CPM (CY OR N)°55S 
IF S$="N" THEN GOTO 53540 
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REM 
JIAO 


REM 
IvOO 


REM 
REM 
REM 
30 


REM 


Be 


REM 


REM 


INPUT *"CPM=(CP+1<CPM) * sCPM 
CPM=INTCCPM) 
IF CPM<CF+i THEN PRINT CPM3"* IS ILLEGAL" ;"CP= "sCF sGOTO 3535 


PRINT 

PRINT "TENC= "3 TENC 

IF MA=0 THEN GOTO 3550 

INPUT "NEW TENC (CY OR N)D°3S% 

IF SS="N" THEN GOTO 3550 

INPUT "TENC=(O<=TENC<10)" 3 TENC 

TENC=INTCTENC) 

IF TENC=0 OR TENC?9 \ 

THEN FRINT TENC;" IS ILLEGAL” =:\ 

GOTO S545 

IF TENC=9 THEN TENTH=0 ELSE TENTH=1 


PRINT 

PRINT "Ri0= "R10 

INFUT "NEW R1LO (CY OR N)"3S$ 

IF S$="N" THEN GOTO 5540 

INPUT "R10 (O<=R10<25)="sR10 

RLO=INTCRIO) 

IF Ri0O<O OR R10224 THEN PRINT R1i0;" IS ILLEGAL =: GOTO 5555 


REM 
RETURN 


T10=6 
CCEN=0 
MUX=3 
S$li=3 
FE=Q 
ZEROH=1 
ZEROL=0 
CN=0 

MR 1 REM REY 2 
VB=0 
FL=QO 
RETURN 


110=12 

$11=0 

FE=1 

ZEROH=1 

ZEROL=0 

CN=0 

HE=1 REM REV 2 
VB=0 

PLe23 

RETURN 


110=14 
Sii=2 

FE=4 

ZEROH=1 

ZEROL=0 

CN=1 

HB=1 REM REV 2 
VE=0 

RETURN 


110=3 
Figure B1 (Cont.) 
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REM 


& 


REM 


REM 


CCEN=0 
MUX=1 
$11=0 
FE=1 
ZEROH=1 
ZEROL=1 
CN=1 
HE=0 
VB=Q 
PLS 
RETURN 


T10=3 
CCEN=0 
MUX= 1. 
$11=0 
Fis) 
ZEROH=1 
ZEROL=1 
CN=1 
HR=0 
VE=Q 
Fil=4 
RETURN 


110=3 
CCEN=0 
MUX=1 
S11=0 
FE=1 
ZEROH=1 
ZEROL=1 
CN=1 
HE=0 
VE=O 
Plas 
RETURN 


I110=3 
CCEN=0 
MUX=1, 
$11=0 
FE=1 
ZEROH=1 
ZEROL=1 
CN=1 
HR=0 
VB=0 
FL=6 
RETURN 


T10=3 
CCEN=0 
MUX=1 
S110 
FE= 1 
ZEROH=1 
ZEROL=1 
CN=1 
HR=O 
VB=Q0 
FL=7 
RETURN 


Figure B1 (Cont.) 


REM 
10 


REM 


12 


REM 
13 


REM 
14 


T10=1 
CCEN=0 
MUX=0 
$11=0 
FE] 
ZEROH=1 
ZEROL=1 
CN= 1. 


GOSUB SO REM TENC 


VB=0 
PL=12 
RETURN 


T1i0=1 
CCEN=0 
MUX=2 
$11=0 
Fi=] 
ZEROH=1 
ZEROL=1 
CN] 
GOSUB SO 
VE=O 
PLe21 
RETURN 


L19=3 
CCEN=0 
MUX=1. 
$11=0 
FE=1 
ZEROH=1 
ZEROL=1 
CN=1 
GOSUB SO 
VB=0 
FL=190 
RETURN 


110=3 
CCEN=1 
$i1=0 
FE 1 
ZEROH=1 
GOSUB 5O 
VE=0 
FL. 
RETURN 


T10=9 
$11=0 
FE=0 
ZEROH=1 
ZEROL=1 
CN=1 
GOSUB 50 
VB=0 
FL=20 
RETURN 


T10=6 
CCEN=0 
MUX=3 
$i1=3 


REM 


15 


REM 


REM 
16 


REM 
17 


REM 
18 


REM 


REM 
20 


FE=O REM REV 10 
ZEROH=1 

ZEROL=0 

GOSUB 50 

VE=] 

RETURN 


110=12 

$11=90 REM REV 10 

FE=1 REM REV 10 

ZEROH=1 

ZEROH=1 REM REMOVED REV 10 
GOSUB 50 

VB=1 

PL=119 

RETURN 


T10=4 
CCEN=0 
MUX=3 
$11=0 
FE=1 
ZEROH=1 
ZEROL=1 
CN=1 
GOSUB 50 
VB=1 
RETURN 


110=3 
CCEN=0 
MUX=1 
S11=0 
FE=1 
ZEROH=1 
ZEROL=1. 
CN=1 
GOSUE 5O 
VB=1 
PL=1é 
RETURN 


T10=8 
$11=0 
FE=1 
ZEROH=1 
ZEROL=1 
CN=1 
GOSUB 50 
VB=1 
RETURN 


T10=12 
$11=0 
FE=1 
ZEROH=1 
ZEROL=1 
CN=1 
GOSUB 50 
VB=1 
PLe23 
RETURN 


110=10 
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CCEN=1 
Si1=0 
FE=1 
ZEROH=1 
ZEROL=1 
CN=4 
GOSUB 50 
VB=4 
RETURN 
REM 
24 110=10 
CCEN=1 
Si1=0 
FE=4 
ZEROH=1 
ZEROL=1. 
CN=1 
GOSUR 50 
VB=O 
RETURN 
REM 
22 110=10 
CCEN=1 
FExO REM REV 9 
ZEROH=0 
ZEROL=1 REM REV 9 
CN=4 
GOSUB 50 
VE=O 
RETURN 
REM 
50 REM TEN-LINE-COUNTER CLOCKING SUBROUTINE 
IF HB=1 THEN RETURN 
HE= 1 
TENC=TENC+1 
IF TENC=9 THEN TENTH=0 ELSE TENTH=1 
IF TENC=10 THEN TENC=0 
RETURN 
REM PUSH AND POF SUBROUTINES REMOVED REV 3 
1000 REM 2910 INSTRUCTIONS SUBROUTINE 
ON I10+1 GOTO 1100,1110,1120,1130,1140,1150,1160,1170,1180, \ 
1190,1200,1210,1220,1230,1240,1250 
REM 3 
1100 REM JZ 
MA=0 REM 2910 Y 
SP10=0 REM 2910 STACK POINTER (=0 REV 3) 
RETURN 
REM 
1110 REM CUS 
IF FNFAIL \ 
THEN MA=PC1O \ 
ELSE MASPL 2\ 
PUSH=1 REM REV 3 
RETURN 
REM 
1120 REM JMAP 
PRINT "JMAP NOT PROGRAMMED" 
RETURN 
REM 
1130 REM CUP 
IF FNFAIL \ 
THEN MA=PC10 \ 
ELSE MASPL 


RETURN Figure B1 (Cont.) 


REM 
1140 


REM 


1150 


REM 
1160 


REM 
1170 


REM 
1180 


REM 
1190 


REM 
1200 


REM 
1210 


REM 
1220 


REM 
1230 


REM 
1240 


REM PUSH 
IF FNFASS THEN R10=PL REM 
MA=FC10 
PUSH=1 REM REV 3 
RETURN 
REM JSRP 
PRINT "JSRP NOT PROGRAMMED" 
RETURN 
REM CIV 
IF FNFAIL \ 
THEN MA=PC10 \ 
ELSE MA=VECTOR 
RETURN 
KEM JRF 
IF FNFAIL \ 
THEN MA=RK10 \ 
ELSE MA=F'L. 
RETURN 
REM RFCT 
IF Ri0=0 \ 
THEN MA=PC10 =:\ 
POF=1 \ 
ELSE MA=FLIOCSP10) =\ 
Ril0=R10-1 
RETURN 
REM RPCT 
IF R10=0 \ 
THEN MA=PC10 \ 
ELSE MA=FL s\ 
Ri0O=R10-1 
RETURN 
REM CRTN 
IF FNFAIL \N 
THEN MA=PC10 \ 
ELSE MA=FLO(SP10) <\ 
PORP=1 REM 
RETURN 
REM CJPP 
PRINT "CJPP NOT PROGRAMMED" 
RETURN 
REM LDCT 
R10=PL. 
MA=PC10 
RETURN 
REM LOOF 
IF FNFAIL N 
THEN MA=F10(SP10) \ 
ELSE MA=F'C10 s\ 
FOP=1 REM REV 
RETURN 
REM CONT 
MA=PC10 
RETURN Figure B1. (Cont.) 


LOAD COUNTER 


REY 3 


3 


76 


REM 
1250 


REM 


REM 
2000 


REM 
2100 


REM 
2200 


REM 
2300 


REM 
2400 


REM 
REM 
2300 


REM 
REM 
REM 
REM 
3000 


REM 
31900 


REM 
3200 


REM 
3300 


REM 
3400 


REM 
REM 
4000 
REM 


REM 


REM TWE 

PRINT "TWB NOT PROGRAMMED" 
RETURN 

REM MUX SUBROUTINE 


ON MUX+1 GOTO 2100, 2200, 2300, 2400 


IF TENTH=0 \ 
THEN CC=0 \ 
ELSE CC=) 
RETURN 


IF CN4=0 \ 
THEN CC=0 N 
ELSE CC=1 
RETURN 


IF COMP=0 \N 
THEN CC=0 \ 
ELSE CC=1 


RETURN 


CC=] 
RETURN 


REM LEAST SIGNIFICANT 2911 (2911L) SUBROUTINE 
IF ZEROL=0 THEN L4B=0 
RETURN 


REM MORE SIGNIFICANT 29115 (2911H) SUBROUTINE 
ON $11+1 GOSUB 3100, 3200,3300,3400 

IF ZEROH=0 THEN Yii=0 

RETURN 


Yi1=PC11 
RETURN 


Y1I1=R11 
RETURN 


YiLl=F1i1 
RETURN 


IF T10=6 \ 
THEN YI1=VECTOR \ 
ELSE Y11=PL. 


RETURN 
REM CLOCK SUBROUTINE 
FPC10=MA+1 REMOVED REV 4 


IF CN=1 THEN L4B=L4B+1 
IF L4B>15 THEN L4B=0 s CN4=1 ELSE CN4=0 
IF CN4=1 \N 

THEN FCLL=Yiilet+l \ 

ELSE FC11=Y11 
IF FE=0 THEN Fii=PCil 
<--KEV 3 

Figure B1 (Cont.) 


REM 


REM 
REM 


IF FPUSH=1 \ 
THEN 


IF SFIO>4 N 
THEN 


IF FOP=1 \ 
THEN 


IF SP10<0 \ 
THEN 


REV 3 --> 
PCLO=MA+1 
CP=CF+1 
RETURN 


SPLO=SP10¢+1 =s\ 
FIOCSF10>=PC10 s\ 
FUSH=0 


PRINT "2910 STACK FULL " s\ 
SFLO=3 


SFPLO=SP10—-1 =\ 
POF =0 


PRINT "POP EMPTY FILE? "3CP s\ 
SF1LO=0 


REM REV 4 


Figure B1 (Cont.) 
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APPENDIX C 


A simple circuit was designed to accommodate five different 
display formats and also to comply with the European 50Hz TV 
standard. Figure C1 !s the circuit diagram of this additional circuit. 


The following parameters change when the format is changed: 

1) The number of characters/line. 

2) The number of lines/frame. 

3) The number of characters to display (1.e., the address of the 
last character). 

4) The line frequency and therefore the dot frequency. 


The number of characters/line is counted by the least significant 
Am2911 sequencer via the microcode. Therefore, the microcode 
can be changed to change the number of characters/line. The 
number of lines/frame is counted by a constant, loaded into the 


133 2 


U24 
1/2 Am25LS240 
wet ss 
7 6 5 4 3 2 1 «0 


B 
US 
Am25LS2521 


FORMAT 


Am2910 internal counter by the microcode. The microcode can 
be changed to vary the number of lines/frame. 


The scan is reinitialized to zero when the last address +1 is 
attained. Ug (Am25LS2521) detects this address by comparing 
bits A, through Ajo of the character address bus to a constant 
supplied to its B inputs. A table listing these constants is shown in 
Figure C1. By setting the DIP switches according to that table, the 
character scan will reinitialize correctly. The same constant is 
routed through one half of an Am25LS240 (U24) to the internal 
data bus. At microprogram address zero, a JUMP MAP instruc- 
tion enables these outputs thereby putting a starting address on 
the bus according to the table in Figure C1. 


The microprogram is shown on Figure C2. 


CONNECT : 
FOR 50 Hz 1Cg 1C, 1Cp 1C3 
1G 
U41 


Am2SLS153 y¢ 


MPR-501 


LAST COMPARE AT MAP 
CHAR. ADD. +1 | LAST ADD/16 | S3; S2 S; Sg | ADDRESS DOT FREQ. (MHz) 
78H | H H H H OFO 


10.86624 
9.09216 
5.544 
5.376 
3.65568 





Figure C1. 
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AD>TYPE CRT.DEF 


3 

sCkT DEFINITION FILE 
;BY MCSHEE M. SHAVIT 
SREV @ 3/8/78 


’ 


LITLE CRT CONTRCLIER --DEFINITIONS 


WORD 24 

b] 

FE: DEF 1VE#1 ,23Xx 

ZEROH: TEF 1X, ,1VE#1,22x 

Sli DEF ZX ,cV4E3Q#, EOX 

I12: DEF 4X,4VH#,1EX 

CN: DEF OX,1VBR1,14% 

ZERCL: DEF 10X,1VB#1,13X 

VE; DEF 11X,i1VBa2,12XxX 

ERE: DEF 12X,1VBH#0,11X 

CCEN: DEF 13X,1VE#,10% 

MUX: DEF 14X,B#e0,8X 

MUX1: LEF 14X,B#12,8X 

MUXE: DEF 14X,B#@1,8X 

MUXSs DEF 14X,B#11,8X 

Pls DEF TEX.SVZ5 

9 

Le EQU R#D 

Ks: KQU BHI 

9 

CCUNT: DEF be1, BAHL, BAZ, 5X, BH1, BH1, RC, BAG ,1X,2X,8% 
COUKTE: DEF BH1,BH1, BHO? ,5X ,FH#1,BH1, BHC, BH1,1X,2xX, 8X 
COUNTV: LEP B#1,B#1, BHO ,5X,b4#1,BH#1,5b#1,2#1,1%,2X,8% 
9 

FAL 

A> 


Figure C2. AMDASM Definition and Assembly Files for the CRT Controller. 
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AMDOS/29 AMDASM MICRO ASSEMBLER, V1.1 
CRT CONTROLLER 


BOLLS 


S221 
BOB2 


BOOS 
0204 
O2S5 
OBOE 
0207 
ZBL 
GOO 
ZOOA 
COCKE 
BBC 
OSD 
ACK 
OGOE 


COCK 
@Z1¢ 
@G11 
QW12 
Q@013 
@014 


@015 
9@16 


0917 
@018 


C019 
OO1A 
@01B 
O21C 
@G1D 
OG1E 
OG1F 
BO22 
O21 
QG22 
ACK 
B82 


O224 
O25 
OO26 
O02 


;CFT CONTRCLLER MICROFROGRAY 


$ 
+EY MOSHE M. ShAVIT 
sREV 2 5/3/7E& 


2482: 


11d 


H#2 


24 ROWS 


118 


H#E 


/ CN L & HB EB 


/VB & 
M2480: 


T2486: 


112 


H#C 


PL D#25 


110 
11 
I1¢ 
110 
11 
118 
110 
11 
112 
110 
112 


110 


H#E 
H#3 
H#3 
H#3 
H#3 
H#3 
H#1 
H#1 
H#3 
H#S 
H#9 


H#E 


/ HEH & VB B 


’ 
GOBACK: 
LASTAS: 


y 
; 
} 
’ 
; 
$2464: 

/ CN 


119 
110 
11é 
110 
11@ 
112 


110 
118 


HAC 
H#e4 
H#3 
H#8 
H#C 
H#A 


H#A 
H#A 


24 ROWS 


110 


H#6 


L & HB EH 


T1@ 


H#C 


/VB & PL D#23 


M2464: 


T2464: 


110 
1108 
119 
110 
11 
110 
110 
1168 
118 
T1@ 


118 


H#k 
H#S 
H#3 
H#3 
H#3 
H#1 
H#¥1 
H#3 
H#3 
H#9 


H#6 


/ HBH & VB H 
118 H#C 
I1@ Hr4 
Ii@ H#3 
I1@ H#8 


3JUMP MAP 


8@ CHARACTERS 68 F/S 


VB 


CCEN 
CCEN 
CCEN 
CCEN 
CCEN 
CCEN 
CCEN 
CCEN 
CCEN 


CCEN 
CCEN 


PMMMMM MM MMM MMOMMMMO MMM 


CCEN 


& CCEN 
& CCEN 


Meer eee 
MM OM 1 MOM! MO 


L 
L 


COUNTV 
COUNTY 


H 
H 


& 
& 
& 
& 
& 
& 


MUX1 
MUX1 
MUX1 
MUX1 
MUX1 
MUX 
MUX2 
MUX1 


MM MMM MM M 


CCEN L & MUXS & S11 3 & FE L & ZEROH & ZEFROL L& 
511 @ & FE & ZEROH & ZERCL L & CNL & HE EH & 
S11 2 & FE & ZEROH & ZEROL L & CN & HB EH & VB 


COUNT & PL 
COUNT & PL 
COUNT & PL 
COUNT & PL 
COUNT & PL 
COUNTH & PL T2482 
COUNTH & PL LASTA 
COUNTH & PL $ 


ARASH A 


511 @ & FE & ZEROH & HB EH & VB & PL M248 


Si1 @ & FE L & ZEROH & ZEROL & CN H & HB HEH & VB & PL GOR 
CCEN L & MUXS & S11 3 & FE L & ZEROH & ZEROL L & 
S11 @ & FE & ZEROH & HE EH & VB H & PL D#146 


MUX3 & COUNTV 
MUX1 & COUNTV & PL $ 


PL D#235 


COUNTV 


COUNTH 
FE L & ZERCH L & ZEROL & CN H & EB EH & VB 


64 CHARACTERS 6@ F/S 


VB 


CCEN 
CCEN 
CCEN 
CCEN 
CCEN 
CCEN 
CCEN 
CCEN 


MMMM M MM MMM MMM MOM > 9? & 


Mette et eee 
MAMMMMMM 


MUX1 


& 


MUX1 & 


MUX1 
MUX1 
MUX 
MUX2 


MUX1 | 


& 
& 
& 
& 
& 


CCEN L & MUXS & S11 3 & FE L & ZEROH & ZEROL L & 
511 @ & FE & ZEROH & ZEROL L & CNL S& HE EH& 
S11 2 & FE & ZEROH & ZEROL L & CN & HB H & VB 


COUNT & PL 
COUNT & PL 
COUNT & PL 
COUNT & PL 
COUNTH & PL T2464 
COUNTH & PL LASTA 
COUNTH & PL $ 


MS 


S11 @& FE & ZEROH & HB H & VB & PL M24€4 


S11 @ & FE L & ZFROH & ZEROL & CN H & HB H & VB & PL GOB 
CCEN L & MUXS & S11 3 & FE L & ZEROH & ZEROL L & 


S11 @ & FE & ZEROH & HB BH & VB EH & PL D#122 
CCEN L & MUX3 & COUNTV 

CCEN L & MUX1 & COUNTV & PL $ 

COUNTV 


Figure C2 (Cont.) 


AMDOS/29 AMDASM MICRO 
CRT CONTROLLER 


SOLE 
2029 


OLA 
OC2B 


GO2C 
@620 
CO2E 
@e2F 
S030 
OO31 
CCS52 
9833 
ACK 
QO34 


@235 
OLOSE 
@237 
OG28 
@B3o 
GOSA 


OOSE 
GB3C 


OSD 
QO3E 
OOSF 
Q04¢ 
Q241 
2O42 
C043 
0044 
ACK 
@845 


QB4E 
C247 
9048 
BB49 
O24A 
C045 
Q84C 
024D 
QO4E 
O24F 


118 
116 


GC) we MWe We we We WE 


2432: 1198 


H&#C 
H#A 


24 ROWS 


h#6 


/ CN LS& HB E 


118 


HAC 


/VB & PL D#2e 


M245e3 112 
110 
11d 
110 
11 
112 
11¢ 

T2432: 112 


11¢ 


H#E 
H#3 
H#3 
H#i 
H#1 
H#3 
H#3 
H#9 


H#E 


/ HB EH & VB HY 


112 
11 
T1108 
118 
112 
112 


Td we we we we we 


1€32: 110 


H#C 
H#4 
H#3 
H#8& 
H#C 
H#A 


16 ROWS 


H#6 


/ CN L & HB B 


11 


HAC 


/VB S PL D#15 


M1632: 110 
112 
11¢ 
110 
112 
119 
I1@ 

T1€352: 110 


112 


H#E 
H#3 
H#3 
H#1 
H#1 
H#3 
H#3 
H#9 


K#6 


/ HB EH & VB SE 


11 
11 
11 
118 
I1@ 
119 
110 
11 
11 
110 


@We we We we we 


H#C 
H#4 
H#3 
H#8 
H#C 
H#4 
H#2 
H#e 
H#C 
H#A 


16 ROWS 
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ASSEMBLER, V1.1 


& 
& 


COUNTV & PL D#23 
CCEN H & COUNTY 


S2 CHARACTERS 6@ F/S 


PM MM OOH a) MMOMMMMMM MMM 


CCEN L & MUX3 & S115 & FE L & ZEROH & ZEROL L & 
VB 
S11 @ & FE & ZERCH & ZEROL L & CNL & BB HS& 


S1i 26 FE & ZEROH & ZEROL L & CN & HEE & VB 

CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUXO & COUNTH & PL T2432 

CCEN L & MUX2 & COUNTH & PL LASTA 

CCEN L & MUX1 & COUNTH §& PL $ 

CCEN H & S11 © & FE & ZEROH & HB H & VB & PL M2432 

511 @ & FE L & ZEROH & ZEROL & CN H & HB EH & VB & PL GOB 


CCEN L & MUX3 & S11 3 & FE L & ZEROH & ZEROL L & 


S11 6 & FE & ZEROH & HE H & VB EH & PL D#74 
CCEN L & MUX3 & CCUNTV 

CCEN L & MUX1 & COUNTV & PL $ 

COUNTY 

COUNTV & PL D#23 

CCEN E & COUNTY 


S2 CHARACTERS 60 F/S 


MMMM MMO AMMA M MMM MMM MM MO OH 


CCEN L & MUXS & S11 3 & FE L & ZEROH & ZEROL L & 
VB 
S11 ®@ & FE & ZEROH & ZFROL L & CNL & HERES 


S11 26& FE & ZEROH & ZEROL L & CN & HB OE & VB 


CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUX® & COUNTH & PL 11632 

CCEN L & MUX2 & COUNTH & PL LASTA 

CCEN L & MUX1 & COUNTH & PL $ 

CCEN H & S11 @ & FE & ZEROH & HB H & VB & PL M1632 


511 @ & FE L & ZEROH & ZEROL & CN BH & HB EH & VB & PL GOB 
CCEN L & MUXS & S11 3 & FE L & ZEROH & ZEROL L &. 


S11 ®@& FE & ZEROH & HB H & VB HE & PL D#25@ 
CCEN L & MUX3 & COUNTY 

CCEN L & MUX1 & COUNTV & PL $ 

COUNTY 

COUNTV & PL D#48 

CCEN L & MUX3 & COUNTY 

CCEN L & MUX1 & COUNTV & PL $ 

COUNTV 

COUNTV & PL D#15 

CCEN H & COUNTY 


16 CHARACTERS 6@ F/S 


Figure C2 (Cont.) 


AMDOS/Z9 AMDASM MICRO ASSEMBLER, V1.1 
CRT CONTROLLER 


@5@ $1616: 110 H¥E6 & CCEN L & MUX3 & S11 3 & FE L & ZEROH & ZEROL L & 
/ CN L & HB E & VB 
@051 119 H#C & $11 09 & FE & ZEROH & ZEROL L & CN L & HB HE & 
/VB & PL D#15 
@@52 M1€16: 110 H¥E & S11 2& FE & ZEROH & ZEROL L & CN & HB H & VB 
8053 119 H#3 & CCEN L & MUX1 & COUNT & PL $ 
6054 11@ H#1 & CCEN L & MUX® & COUNTH & PL T1616 
@O55 11@ H#1 & CCEN L & MUX2 & COUNTH & PL LASTA 
ZO5E 11g H#3 & CCEN L & MUX1 & COUNTH & PL $ 
GO5% 119 H#Z & CCEN H & S11 @ & FE & ZEROH & HB EH & VB & PL M1616 
@@5@ T1616: 110 H¥9 & $11 @& FE L & ZEROH & ZEROL & CN H & HB E & VB & PL GOB 
ACK 
@059 1i@ H¥6 & CCEN L & MUX2 & S511 3 & FEL & ZEROH & ZEROL L & 
/ HB H & VB H 
OOSA 119 H#¥C & S11 @ & FE & ZEROH & HE HE & VB EH & PL D#203 
OO5B 110 H#4 & CCEN L & MUXS & COUNTY 
Z25C 110 H#3 & CCEN L & MUX1 & COUNTV & PL $ 
@95D Tig H#8 & COUNTY 
QO5E 11@ H#C & COUNTV & PL D#15 
GO5¥F 119 H#A & CCEN H & COUNTY 
9 
OOFE ORG HA GEO 3 24*BD 
GOFC Tig H#¥3 & CCEN H & PL S248 
9 
’ 
GOOFS ORG H#OFS 3 24¥*64 
GOOFS I1@ H#3 & CCEN H & PL 82464 
’ 
9 
GOFO ORG H#SFO 3 24*352 
OFC 119 H#3 & CCEN H & PL S2432 
’ 
’ 
GOYE ORG H#AOFB 7 16*32 
QOOFB I1@ H#3 & CCEN H & PL $1632 
’ 
’ 
@OFD ORG H#OFD + 16*16 
@O¥FD 11@ .H#5 & CCEN H & PL 81616 


5@ F/S ROUTINES 


we We we we We 


G10 ORG H#100 
24 ROWS 8@ CHARACTERS 5@ F/S 


CA we we we we 


@1@@ S$248@E: 110 H#¥E & CCEN L & MUX3 & S11 3 & FE L & ZEROH & ZEROL L & 
/ CNL & HB ES& VB 

0121 11@ H#C & S11 @ & FE & ZEROH & ZEROL L & CNL & HB ES&S 
/VB & PL D#23 

@1@2 M248@E: 11@ H#E & S11 2 & FE & ZEROH & ZEROL L & CN & HB H & VB 

102 119 H#3 & CCEN L & MUX1 & COUNT & PL $ 

9104 119 H#3 & CCEN L & MUX1 & COUNT & PL $ 

9105 11@ H#S & CCEN L & MUX1 & COUNT & PL $ 

9106 110 B#¥3 & CCEN L & MUX1 & COUNT & PL $ 

2107 I11@ H#3 & CCEN L & MUX1 & COUNT & PL $ 

9128 110 H#1 & CCEN L & MUX® & COUNTH & PL T2480E 

@1e9 11@ H#1 & CCEN L & MUX2 & COUNTH & PL LASTA 

Q10A 110 H¥3 & CCEN L & MUX1 & COUNTH & PL $ 

G10B 110 H#¥3 & CCEN H & S11 @ & FE & ZEROH & HB H & VB & PL M2480E 

G1@C T246@E: 110 H¥° & S11 @& FEL & ZEROH & ZEROL & CN H & HB E & VB & PL GOB 

ACK 


Figure C2 (Cont.) 
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AMDOS/29 AMDASM MICRO ASSEMBLER, Vi.1 
CRT CONTROLLER 


G19D 110 H#E 
/ HB H & VB EB 
210k 11i@ H#C 
201 TIMES 
G10F I1¢@ H#4 
G11 T1@ H#3 
@111 118 H#8 
9 
G11e2 118 H#C 
9113 110 H#4 
0114 11g H#S 
@115 110 H#8 
’ 
0116 11@ Hac 
g117 110 H#A 
3 
3 
; 24 ROWS 
j 
’ 
9118 S2464E: 110 H#E 
/ CN L & HB # 
g119 I1@ H#C 
/¥VB & PL D#23 
Q11A M2464E: 118 H#E 
@118 11@ H#S 
@11C lig H#S 
G11D 11@ H#3 
G11E 1i@ H#3 
@1iF I1@ H#1 
G126 I1@ H#1 
@i21 119 H#3 
@122 11g H#3 
@122 T24E4E: 110 HHS 
ACK 
G124 I1g H#E 
/ HB H & VB 8 
125 I1@ Hac 
G12E 110 H#4 
@127 118 H#3 
@12& I11@ H#8 
g 
@129 Tig Hac 
O1ZA I1@ H#4 
Z12B 11d H#S 
Z12C 118 H#8 
9 
@12D 118 H#C 
O1l2k 11@ H#A 
3 
j 
’ 
’ 24 ROWS 
bd 
g 
@12F S2452E: 110 H#E 
/ CN L & HB SE 
0136 I1@ H#C 
/VB & PL D#23 
151 M2432E: 110 H#E 
B1zzZ 110 H#S 
0133 118 H#S 
134 I1@ H#1 
8135 110 H#1 
8136 I1@ H#3 


> — MMMM MOM M— o> = &} 


CCEN L & MUXS & S11 3 & FE L & ZEROH & ZEROL L & 
S11 @ & FE & ZEROH & HB H & VB H & PL D#200 ;ITERATES 


CCEN L & MUXS & COUNTY 
CCEN L & MUX1 & COUNTV & PL $ 
COUNTY 


COUNTV & PL D#239 

CCEN L & MUXS & COUNTY 

CCEN L & MUX1 & COUNTV & PL $ 
COUNTY 


COUNTV & PL D#23 
CCEN H & COUNTY 


€4 CHARACTERS 56 F/S 


& 
& 
& 
& 
& 
& 
& 
& 
& 
& 
& 
& 
& 
& 
& 
& 
& 
& 
& 
& 
& 
& 
& 
& 


CCEN L & MUXS & S11 3 & FE L & ZEROH & ZEROL L & 
VB 
Si1 @ & FE & ZEROH & ZEROL L & CN L & HB HE & 


S11 26 FE & ZEROH & ZEROL L & CN & HB H & VB 


CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUX® & COUNTH & PL T2464E 

CCEN L & MUX2 & COUNTH & PL LASTA 

CCEN L & MUX1 & COUNTH & PL $ 

CCEN H & $11 @ & FE & ZERCH & HB HE & VB & PL M246€4E 


S11 ®@& FE L & ZEROH & ZEROL & CN H & HB H & VB & PL GOB 
CCEN L & MUX3 & S11 3 & FE L & ZEROH & ZEROL L & 


S11 @6& FE & ZEROH & HB H & VB H & PL D#200 
CCEN L & MUX3 & COUNTY 

CCEN L & MUX1 & COUNTV & PL $ 

COUNTY 


COUNTV & PL D#167 3 5E9 
CCEN L & MUXS & COUNTY 

CCEN L & MUX1 & COUNTV & PL $ 
COUNTY 


COUNTV & PL D#23 
CCEN H & COUNTY 


S32 CHARACTERS 50 F/S 


MMOMMMOM MMM 


CCEN L & MUXS & S11 3 & FE L & ZEROH & ZEROL L & 
VB 
S11 ®@ & FE & ZEROH & ZEROL L & CNL & HB ES 


S11 2 & FE & ZEROH & ZEROL L & CN & HB AE & VB 
CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUX@ & COUNTH & PL T2432E 

CCEX L & MUX2 & COUNTH & PL LASTA 

CCEN L & MUX1 & COUNTH & PL $ 


Figure C2 (Cont.) 
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AMDOS/29 AMDASM MICRO 
CRT CONTROLLER 


81357 

8138 
ACK 

91359 


@13A 
@13B 
@13C 
G13D 
@13E 
@13F 


2142 
0141 


0142 
@142 
$144 
@145 
146 
0147 
@146 
@149 
ACK 
O144A 


G14 
@14C 
@14D 
G14E 
@14F 
8152 
@151 
G152 
@153 
@154 


8155 
@156 


8157 
2158 
@159 
@1i5A 
@15B 
@15C 
@15D 
ACK 
@15E 


@15F 
6160 
6161 
162 


110 
T2432E: 118 


110 


H#3 
H#9 


H#E 


/ HB H & VB § 


112 
112 
T1¢ 
118 
110 
11@ 


1E6S2E: 110 


H#C 
H#4 
H#2 
H#8 
H#C 
H#A 


16 RQWS 


H#E 


/ CN L & HB E 


118 


HAC 


/VB & PL [#15 


MIES2E: 110 
112 
112 
I1¢ 
110 
11 
11d 
T1632E: 116 


110 


H#E 
H#S 
H#3 
H#1 
H#1 
H#3 
H#3 
H#¢° 


HH#E 


/HBEH & VB HE 


11@ 
11d 
112 
11¢ 
118 
110 
110 
110 
110 
116 


Ti we we we we we 


1€16E: 112 


H#C 
H#4 
H#3 
H#6& 
H#C 
H#4 
H#3S 
H#8 
H#C 
H#A 


16 ROWS 


H#6 


/ CN L & HB HE 


118 


H#C 


/VB & PL D#15 


MI€1EE: 110 
11 
118 
112 
110 
119 
TI61EE: 112 


11a 


H#E 
H#S 
H#1 
H#1 
H#3 
H#3 
H#9 


H#6 


/ HB EH & VB H 


110 
Ti@ 
110 
11@ 


H#C 
H#4 
H#23 
H#E 


ASSEMBLER, V1.1 


& 
& 


& 
& 


& 
& 
& 
& 
& 


CCEN H & S11 @ & FE & ZEROH & HB EH & VB & PL M2432E 
$11 @6& FE L & ZEROH & ZEROL & CN H & HBH & VB & PL GOB 


CCEN L & MUXS & S11 3 & FE L & ZEROH & ZEROL L & 


S11 @& FE & ZEROH & HB HE & VB E & PL D#224 
CCEN L & MUX3 & COUNTY 

CCEN L & MUX1 & COUNTV & PL $ 

COUNTY 

COUNTV & PL D#23 

CCEN H & COUNTY 


32 CHARACTERS £@ F/S 


PMMPMMMMOM MM MM MMMM MMMN MOM 


CCEN L & MUX3 & S11 3 & FE L & ZEROH & ZEROL L & 
VB 
Sil @ & FE & ZEROH & ZEROL L & CNL & HB EHS& 


S11 2 & FE & ZEROH & ZEROL L & CN & HB H & VB 
CCEN L & MUX1 & COUNT & PL § 

CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUX® & COUNTH & PL T1632E 

CCEN L & MUX2 & COUNTH & PL LASTA 

CCEN L & MUX1 & COUNTH & PL $ 

CCEN H & S11 @ & FE & ZEROH & HB H & VB & PL M1632E 

S11 @ & FE L & ZEROH & ZEROL & CN H & HB H & VE & PL GOB 


CCEN L & MUXS & S11 3 & FE L & ZEROH & ZEROL L & 


Si1 @ & FE & ZEROH & HB H & VB H & PL D#25@ 
CCEN L & MUX3 & COUNTY 

CCEN L & MUX1 & COUNTV & PL $ 

COUNTY 
COUNTV & PL D#223 5475 

CCEN L & MUX3 & COUNTY 

CCEN L & MUX1 & COUNTV & PL $ 
COUNTY 
COUNTY & 
CCEN H & 


PL D#15 
COUNTY 


16 CHARACTERS 5@ F/S 


& 
& 
& 
& 
& 
& 
& 
& 
& 
& 
& 

& 
& 

& 
& 


CCEN L & MUXS & S11 3 & FE L & ZEROH & ZEROL L & 
VB 
S11 @ & FE & ZEROH & ZEROL L & CNL & HBE& 


S11 2 & FE & ZEROH & ZEROL L & CN & HB HE & VB 

CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUX® & COUNTH & PL 1T1616E 

CCEN L & MUX2 & COUNTH & PL LASTA 

CCEN L & MUX1 & COUNTH & PL $ 

CCEN H & $11 @ & FE & ZEROH & HB H & VB & PL M1616E 

S11 @& FE L & ZEROH & ZEROL & CN H & HB E & VB & PL GOB 


CCEN L & MUXS & S11 3 & FE L & ZEROH & ZEROL L & 


Si1 @ & FE & ZEROH & HB H & VB H & PL D#200 
CCEN L & MUX3 & COUNTY 

CCEN L & MUX1 & COUNTY & PL $ 

COUNTV 
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AMTOS/29 


AMDASM MICRO 


CRT CONTROLLER 


e 
9 


ASSEMBLER, Vi.1 


7OLO 


@1291001X1181XxXX 
@1110119XX@11¢11 
11001100XXX11XXX 
11900100X11119011 
119699011X1111010 
119@1000X1111XXX 
11001100X1111XXxX 
11001010X11111XxX 
€111801189X8801911 
1190110@XOOG1XxXX 
11101110X1001XXX 
1102¢@011X1199010 
11020011X1120010 
11000201X1191020 
11960001X1101001 
11960011X1101910 
11009011XXXG11XxX 
G1961001X1101XxXxX 
01112110XX011011 
110011@00XXX11XXX 
11809100X1111011 
119@8611X1111019 
11991900X1111XXX 
11@81109X1111XXX 
1100101@X11111Xx 
$1119110X9691911 
11981180XO0G1XXX 
11101110X1@@1iXXxxX 
11808011X1190910 
11089011X119661@ 
1190800 1X11810008 
11990001X1181601 
11069011X118101¢6 


8163 1106 H#C & COUNTV & PL D#121 
0164 I11@ H#4 & CCEN L & MUXS & COUNTY 
Q165 110 H#3 & CCEN L & MUX1 & COUNTV & PL $ 
0166 110 H#8 & COUNTV 
9 
@1E% 11@ H#C & COUNTV & PL D#15 
0168 110 H#¥A & CCEN H & COUNTV 
9 
O1FO ORG H#1FO 7 24* 2D 
O1FO 110 H#3 & CCEN H & PL S248@E 
9 
3 
O1FS ORG H#1FS 324* 64 
@1¥FS 116 H#3 & CCEN H & PL S2464E 
’ 
9 
O1FO ORG H#1F9 3 24¥*52 
@1FO 110 H#3 & CCEN H & PL S2432E 
’ 
J 
Q1FB ORG H#1FB 716*32 
O1FB 110 H#Z & CCEN H & PL S1632E 
’ 
’ 
Q1FT ORG H#1FD 3 16*16 
Z1FD I1@ H#S & CCEN H & PL S1616E 
’ 
; 
END 
OOOO XXXXGOISGXXXXXAXXX XXXXXXXX CG22 
O91 @1110110X@001011 XXXXXXXX @823 
OOG2 118011GOXGSG1XAX 9GSiGiii1 O24 
@O9@5 11191110X1G01XXX XXXXXXXX O25 
204 1100898011X1190819 89800190 @S2E 
@OZ5 110600011X1198G1F 98G20101 O27 
BOGE 11800911X1188010 20800110 OC2E 
QO2O7 11200011X1198218 09808111 GB29 
OYGE 11000011X11GGS12 CYOG19Ge OO2A 
G@O8S 118080081X1181008 99801181 OO2B 
OOZA 11990081X1181081 90810118 GO2C 
GOO@B 11280011X1191818 00801011 OO2I 
GOGC 1190G9811XXXG11XX GGGOGG11 OLO2E 
OCD G1@81C21X11GiXXX OYG10101 OO2F 
@SOE O111G118XAG11G11 AXAXAXXXXXX OO3@ 
OOGF 110011@OXXX11XXX 10810018 OOS1 
@G1@ 1100@0100X1111011 XXXXXXXX G032 
@G11 11800011X1111018 898010001 QB33 
@012 11901000X1111XAK AXXXXXXX 0054 
@@13 11001100X1111XXX OGG1G111 Q035 
@014 11061010X11111XX- XXXXXXXX OL3E 
0815 11001019X11011XX XXXXXXXX 0037 
@216 O@@XX101GX11G11XX AXAXXXXX 0038 
G17 @©11180110XO9@G1G11 XXXXXXXX OO3¢ 
@G1E 1100110S9X9OG1XXX GGG1G111 OSA 
@219 1112111@X1@@1XXX AXAXXXXX OO3B 
OZ1A 118008011X1198819 990110148 O23C 
21B 119600011X1180010 020011011 OO3D 
@@1C 11000011X1108218 02011109 QO3E 
@21D 11090011X1100819 99811101 OQ3F 
@G@1E 11802001X1181900 88102010 904 
@@1F 11800001X1101901 09810110 0641 
@G@2G 11090011X1121819 69198900 0242 
@221 110@0011XXXG11XX OGG11901 00435 


1198001 1XXX911XX 
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220186101 
XXXXXXXX 
11119012 
XXXXXXXX 
2100119 
XXXXXXXX 
0010111 
XXXXXXXX 
XXXXXXXX 
0012111 
XXXXXXXX 
09191191 
80101119 
@8110011 
88010112 
@0110¢2@1 
02181100 
8210181 
XXXXXXXX 
1001910 
XXXXXXXX 
0110111 
XXXXXXXX 
09210111 
XXXXXXXX 
XXXXXXXX 
00001111 
XXXXXXXX 
09111119 
00111111 
91000100 
2010118 
81899010 
90111191 
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AMDOS/29 AMDASM MICRO ASSEMBLER, Vi.i 
CRT CONTROLLER 


0044 
0845 
OB46 
0047 
0046 
G@B49 
OO4A 
0043 
@B4C 
@B4D 
OO4E 
HO4F 
BOE 
8051 
QO52 
@252 
GO54 
@O55 
0856 
O25" 
0858 
O25 
@Q5A 
Q@O5B 
@25C 
QO5D 
@O5E 
OO5¥F 
OOFL 
BOFS 
OBS 
Q2FB 
OO¥D 
Q100 
G11 
@182 
G12 
0104 
8195 
2106 
G10" 
8198 
@10S 
Z1@A 
G18 
G18C 
21D 
G10 
@10F 
8116 
@111 
@1ic 
91138 
0114 
@115 
@116 
9117 
9118 
g119 
G11A 
@118 
@11C 
€111 


810016001X1101XXxX 
@1110110XX811011 
11081109XXX11XXX 
110@8100X1111911 
11862011X1111012 
1100190@X1111XXX 
11601100X1111XxXX 
119001900X1111911 
11990011X1111819 
11081000X1111XXXxX 
110@1100X1111XxXX 
11001019X11111Xx 


01118110X0901011. 


1190110O0X@@G1XXX 
1119111@X1001XXX 
11080811X1100012 
11800001X11210¢0 
11000001X1101901 
11800011X1181010 
11000011XXXO11XX 
@1901001X1101XXxX 
@61110110XX@11011 
1120110OXXX11XXX 
11000108X1111011 
11900011X1111010 
11@01900X1111XXX 
11001100X1111Xxx 
11001010X11111XX 
XXXXOO1 TXXXXX1XX 
XXXXOO1 TXXXXX1LXAX 
XXXXOOG11XXXXX1XX 
XXXXOO11XXXXX1XX 
XXXXOOG@1L1IXXXAXX1XX 
011180118X00019611 
11021100XOOC1 XXX 
11121110X1901XXX 
11092211X1109010 
110@0011X118¢01¢e 
11200011X1109012 
11080011X1100012 
11060011X1190012 
11998001X1101000 
11000001X1101901 
11900011X11@1910 
1100001 1XXXO11XX 
1001001X1101XXX 
@1110110XX011011 
11091100XXX11XXxX 
11900109X11119811 
11000011X11110190 
119061900X1111XXxX 
11081190X1111XxxX 
11989100X1111011 
11060011X1111010 
11601000X1111XXX 
11001100X1111XXX 
1180918010X11111XX 
%1119110X9001911 
11091169XOGO1XXX 
11181110X1001 Xxx 
11000811X1108012 
11090011X1100812 
11098811X1108012 


$0818181 
XXXXXXXX 
11111919 
XXXXXXXX 
01801082 
XXXXXXXX 
82119000 
XXXXXXXX 
919911980 
XXXXXXXX 
@20001111 
XXXXXXXX 
XXXXXXXX 
001111 
XXXXXXXX 
81910011 
01911882 
9010118 
81919119 
1012012 
@9010161 
XXXXXXXX 
11601011 
XXXXXXXX 
619111080 
XXXXXXXX 
8091111 
XXXXXXXX 
COOSLOL1 
8619111 
88101210 
@@111011 
1810000 
XXXXXXXX 
998010111 
XXXXXXXX 
OOGOG011 
COLLC1 G2 
COPOC11 
@2000110 
0902111 
02001188 
2018110 
0001210 
BO0OCL10 
28010101 
XXXXXXXX 
119918080 
XXXXXXXX 
BGG1008O 
XXXXXXXX 
11101111 
XXXXXXXX 
08910108 
XXXXXXXX 
00010111 
XXXXXXXX 
XXXXXXIX 
90810111 
XXXXXXXX 
0011811 
20111008 
208111981 
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G11 
@11F 
G12 
@121 
G122 
Gizee 
$124 
8125 
@126 
G127 
6126 
G12¢ 
0124 
G12B 
G12C 
G12D 
G12E 
G1er 
G13 
6131 
@132 
133 
8134 
8135 
Z1ise 
9157 
8138 
8139 
G13A 
@13B 
@13C 
@135D 
@13E 
G13F 
G14 
G141 
@142 
@145 
0144 
@145 
146 
0147 
9148 
G149 
G14h 
G14F 
G14C 
@14D 
G14E 
@14F 
8150 
@151 
@152 
8152 
@154 
9155 
@156 
G15” 
@156 
@15¢ 
@15A 
@15B 
G15C 


11920811X1100010 
11900001X1191900 
11980081X1101021 
11988011X1101010 


11009011XXXG11xXX. 


@1081001X1181XXxX 
@1110110XX911811 
116011@0XXX11XXX 
11060102X1111811 
11@@@@11X1111012 
11001060X1111XXxX 
11981100X1111XXxX 
11080100X1111811 
11909011X11110192 
11891800X1111XXxX 
1100110@X1111XXX 
110019010X11111XX 
@1110119X0001011 
1109811@O0XOOG1XXX 
11181110X1001XXX 
11080011X1100012 
11002011X1190012 
11992001X118190@ 
11008001X11061901 
110299811X1181019 
11000011XXX911XX 
G1901001X1101XXX 
@1110119XX911211 
110011@0@XXX11XXX 
11908109X1111811 
110@9@11X1111¢01¢ 
11001000X1111XXxX 
11001100X1111XXX 
11001010X11111XX 
@1118116X9001011 
11081100XO9O01XXX 
1110111 9X1001XxXX 
118996011X11009019 
11802811X1190019 
119@0001X1181082 
11900001X1101901 
11090911X11916108 
1100801 1XXXG11xXX 
G1001081X11901XXX 
@1110110XX9811011 
119011@@XXX11 XXX 
118900100X1111811 
11099811X1111910 
119@@1002X1111XXX 
11081100X1111XXX 
1180¢@100X1111011 
11900011X1111912 
11901@00X1111XXX 
11901100X1111XXx 
11001010X11111XX 
@1110110X9001911 
1106119@XOOO1 XXX 
11101110X1001XXX 
118080011X1100012 
1100000 1X1101000 
119@0001X1121001 
11000911X11912810 
1100@0011XXXO11XX 


02011110 
28180011 
0201011 
SG1GCL1 
098110190 
0010101 
XXXXXXXX 
11991882 
XXXXXXXX 
90100111 
XXXXXXXX 
18100111 
XXXXXXXX 
81019011 
XXXXXXXX 
2010111 
XXXXXXXX 
XXXXXXXX 
@201¢111 
XXXXX XXX 
62110010 
@@110¢11 
8111900 
00018118 
00112119 
99110001 
G0810101 
XXXXXXXX 
11190020 
XXXXXXXX 
88111100 
XXXXXXXX 
8818111 
XXXXXXXX 
XXXXXXXX 
0001111 
XXXXXXXX 
81009011 
819001980 
01001001 
88818119 
1000111 
01000018 
08991801081 
XXXXXXXX 
1111101@ 
XXXXXXXX 
01001101 
XXXXXXXX 
11911111 
XXXXXXXX 
1010001 
XXXXXXXX 
@2001111 
XXXXXAXXX 
XXXXAXXXX 
08001111 
XXXXXXXX 
01911802 
@1611101 
22012110 
61011011 
®1010111 


AMIOS/2S AMDASM MICRO ASSEMBLER, V1.1 
CRT CONTROLLER 


g15D 
@15E 
@15F 
0168 
@161 
8162 
G1E3 
@164 
8165 
0166 
S1EV 
@168 
ZiFe 
Q1FS 
61FS 
Q1iFS 
G1 FD 


ENTRY 


Z10010@1X1101XXx 
01110110XX811011 
11881100XXX11 XXX 
11906100X1111911 
11980811X1111010 
119@1000X1111XxXX 
11081100X1111XXX 
110090188X11110811 
11080011X1111612 
119861000X1111XxXxX 
110801100X1111XXX 
11081010X11111XxX 
XXXXOO1LIXXXXX1XX 
XXXXOO11XAXXXX1XX 
XXXXOO11XXXXX1XAX 
XXXXQOO11XXXXX1XX 
XXXXOO1IXXXXXIXX 


POINTS 


SYMBOLS 


GOBACK 615 
H 


L 


LASTA 
MIE1E 


O21 
2OO8 
ZO1E 
@O52 


MIEC16E @157 


M1ESe2 


ZOSD 


MIES2E @142 


M2432 


QB2C 


MZ452E 2131 


M2464 


@B1¢ 


M2464E G11A 


M245¢ 


WCBS 


M2480E 2122 


S1E1€ 


GO50 


S1€16E Z155 


S1€3S2 


OSE 


S1622E 2142 


52432 


QQZA 


S2452E @12F 


SZ4E4 


2017 


SZ464E @11€ 


S246 


ZOO1 


S2460E 0102 


T1E16 


2058 


TIE1EL @15D 


T1622 


O44 


TIESCE 2149 


T2432 


QB535 


T24S2E 0138 


T2464 


OPEL 


T2L4E4E 9123 


T248E 


QOD 


TL4ECE G1i2C 


TCTAL 


PHASE 2 ERRORS = 


62018161 
XXXXXXXX 
11991980 
XXXXXXXX 
81190991 
XXXXXXAXX 
81111901 
XXXXXXXX 
81190121 
XXXXXXXX 
08091111 
XXXXXXXX 
SPLSGLVHL 
09211800 
8181111 
D1 SL00C0 
18101981 


My 
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APPENDIX D 


The Microprogrammed CRT Controiler was built on a System 29 
universal card and exercised by the System 29 support proces- 
sor. An Am9080A program was written to fill the character mem- 
ory. Figure D1 Is the listing of this program. In order to observe the 


correct output of the controller, an oscilloscope or CRT monitor 
can be connected through an adaptation circuit shown in Figure 
D2. 


3 

sFPROGRAMM TO WRITE INTO 
3BY MOSHE M. SHAVIT 
sREY 0 3/6/78 


CHARACTER MEMORY 


9 
OLFF = STACK EQU 1FFH SSTACK POINTER 
OOFF = FAR EQU OFFH sFIRST ACTRESS REGISTER O/F FORT 
8000 = CHARALD EQU 8000H  sCHARACTER MEMORY STARTS HERE 
0200 : ORG STACK+1 ;WORKING SPACE ABOVE STACK 
0200 EA ns 1 SFIRST ALIIKESS 
02014 CURA nS 2 SCURRENT ALITIRRESS 
0203 FIL 1S 2 FACFIRST CHARACTER IN L [NED 
¥ 
0100 ORG 100H SPROGRAM STARTS HERE 
0100 B1FFOL LXT SF STACK 
0103 213087 LXI H,730H+CHARAD  ;LAST LINE, FIRST CHARACTER 
0106 220302 SHLIN FIL SIN "FIRST CHARACTER IN LINE" BU: FER 
0109 220102 SHLU CURAD  3AND IN CURRENT ALIIRESS BUFFER 
OL0C AF XRA A sCLEAR A 
O10l DEF OUT FAR SSTART ALLIRESS=0 
OLOF 320002 STA FA SSAVE IN BUFFER 
O112 COLBOL CALL CLEAR  $CLEAR ALL CHAR. MEMORY 
O115 CHeC01 MAIN CALL CHARIN $REAI CHARACTER AND FUT IN CHAR. MEMORY 
0118 CBi5014 IMP MAIN st10 IT AGAIN 
bd 
y 
O11B 0600 CLEAR = -MVI B,0 sLATA=0 
O11D 210080 LXI H, CHARAL! sFIRST CHARACTER ADDRESS, 
120 110008 LXI 1,20480 ;COUNTER 
0123 70 CLEAR1 MOV ME SCLEAR THAT ADDRESS 
0124 1B TCX t = COUNT 
0125 23 INX H SNEXT ALDIRESS 
0126 7A MOY A,t s CHECK 
0127 BS ORA e : TF DONE 
0128 C22301 JNZ CLEARL 3NO. CONTINUE 
O12H CO RET sYES. BACK TO CALLER 
bY 
¥ 
0120 OFO4 CHARIN MVI Cyi SCP/M READ COLE 
OLZE CLOS0O CALL 5 SCF/M REAL ROUTINE 
0131 FELA CFI 1AH sCTL-Z? 
0133 CA0000 JZ 0 sRETURN TO CPM IF YES 
0136 2A0102 LHLE CURAD  3FETCH CURRENT ADDRESS 
0139 FEOD CPT OnH :CR? 
O13B CA4401 JZ CRLF sYES. 
O1ZE 77 MOV MA sWRITE CHARACTER 
O13F 23 INX H + INCREMENT 
0140 220102 SHLI CURAD  ;STORE IN BUFFER 
0143 C9 RET ;BACK TO CALLER 
y 
0144 ES CRLE PUSH H 
0145 05 PUSH I 
0146 CS FUSH 
O147 FS PUSH FSW 
0148 1E0A MUI E,0AH 
014A O£02 MVI C,2 
014C CLOSOO CALL 5 
O14F F1 POF FSW 
0150 C1 FOP i 
O151 D1 FOF tI 
0152 E41 FOP H sROUTINE TO ECHO LF 
0153 EB XCHG ;SAVE CURRENT ADDRESS IN DE 


Figure D1 


89 


0154 015000 LXI B,800 380 CHARACTERS/LINE 
0157 2A0302 LHLD FIL sFETCH FIRST CH. IN LENE ADDRESS 
O15A 09 DAL B sHL= ACNEXT LINE’S FIRST CH. ADD.) 
O15 EB XCHG SHL=CURRENT ADDR. ,DIE=ACNEXT LING ELKST CH. ALDRD 
O15C 0600 MVI B,O sLATA=0 
O15E 7C CRLF2 MoV A,H sMORE SIGNIFICANT CURRENT ADDRESS 
O15F BA CMF a s=NEXT LINE FIRST ADDRESS? 
0160 C26801 -INZ CRLF3  :NO 
0163 70 MOV A,L sLESS SIGNIFICANT CURRENT ADDRESS 
0164 BB CMF E 31S CURRENT LINE FULL? 
0165 CA6DO1 JZ CRLF4  sYES 
0168 70 CRLF3 MoV M,B sSTORE 9 AT THAT ADDRESS 
0169 23 INX H s;INCREMENT ADDRESS 
016A C35E01 JM’ CRLF2  +;GO CHECK AGAIN . 
O16D 7C CRLF4 MOV A,H ;MORE SIGNIFICANT FART OF ADDRESS 
O16E E607 ANI ? sONLY 3 LESS SIGNIFICANT BITS 
0170 FEO7 CPI 7 sLAST LINE PASSED? 
0172 C27E01 JINZ CRLFS  s3NOT YET 
0175 72 MOV A,L 3LESS SIGNIFICANT BYTE OF ADDRESS 
0176 FESO CPI 80H SARE WE AT 780H=19201I7 
0178 C27E01 JINZ CRLFS  3;NOT YET, SKIP 
O17EF 210080 LXI H, CHARAL sYES, START WRITING AT BEGINNING OF CH. MEM. 
O1L7E 220302 CRLFS SHLD FIL ;STORE IN FIRST CH. IN LINE BUFFER 
0181 220102 SHLD CURAD s3AND IN CURRENT ALIIRESS BUFFER 
0184 3A0002 LOA FA sFETCH FIRST VISIIBLE CHARACTER ADDRESS 
0187 C605 ALT 5 sSCROLL. 
0189 FE78 CPI 1200 ;TOO MUCH? 
O18 CC9401 CZ CRLFO  ;YES 
O18E 320002 STA FA ;STORE IN FIRST ADDRESS BUFFER 
0191 D3FF QUT FAR sLOAD REGISTER 
0193 C9 RET sRETURN TO CALLER 
+ | 
y 
0194 AF CRLFO XRA A ;FIRST ALMIRESS=0 
0195 C9 RET 


Figure D1 (Cont.) 


Oscilloscope Connections. Ball Monitor Interface. 
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Chapter Ill 
The Data Path 


INTRODUCTION 


The heart of most digital arithmetic processors is the arithmetic 
logic unit (ALU). The ALU can be thought of as a digital subsys- 
tem that performs various arithmetic and logic operations on two 
digital input variables. The Am2901A and Am2903 are Low 
Power Schottky TTL arithmetic logic unit/function generators that 
perform arithmetic/logic operations on two four-bit input vari- 
ables. In most ALUs, speed is generally a key ingredient. There- 
fore, as much parallelism in the operation of the arithmetic logic 
unit as possible is desired. 


The Am2901A and Am2903 ALUs are designed to operate with 
an Am2902A carry lookahead generator to perform multi-level full 
carry lookahead over any number of bits. Therefore, the devices 
have both the carry generate and carry propagate outputs re- 
quired by the Am2902A carry lookahead generator. The devices 
also have the carry output (C,,,4) and a two’s complement over- 
flow detection signal (OVR) available at the output. The net result 
is that a very high-speed 16-bit arithmetic logic unit/function 
generator can be designed and assembled using four of these bit 
slice devices and one Am2902A (the Am2902A is a high-speed 
version of the ‘182 carry lookahead generator). In addition, the 
Am2901A and Am2903 provide a minimum of 16 working regis- 
ters for providing source operands to the ALU. 


UNDERSTANDING THE BASIC FULL ADDER 


The results of an arithmetic operation in any position in a word 
depends not only on the two-input operand bits at that position, 
but also on all the lesser significant operand bits of the two input 
variables. The final result for any bit, therefore, is not available 
until the carries of all the previous bits have rippled through the 
logic array starting from the least significant bit and propagating 
through to the most significant bit. A full adder is a device that 
accepts two individual operand bits at the same binary weight, 
and also accepts a carry input bit from the next lesser significant 
weight full adder. The full adder then produces the sum bit for this 
bit position and also produces a carry bit to be used in the next 
more significant weight full adder carry input. The truth table for a 
full adder is shown in Figure 1. From this truth table, the equations 
for the full adder: 


S=A@®B@C 
Co = AB + BC + AC, 


where A and B are the input operands to the full adder and C 
is the carry input into the adder. 





> 
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1 
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Figure 1. Full Adder Truth Table. 


The sum output, S, represents the sum of the A and B operand 
inputs and the carry input. The carry output, Co, represents the 
carry out of this cell and can be used in the next more significant 
cell of the adder. Full adder cells can be cascaded as depicted in 
Figure 2 to form a four-bit ripple carry parallel adder. 


Note that once we have cascaded devices as shown in Figure 2, 
we may wish to discuss the equations for the i-th bit of the adder. 
In so doing, we might describe the equations of the full adder as 
follows: 


Sj = Ai® Bi @ C; 

Ci44 = A;B; 2 B,C; of A;C; 

where the A; and B; are the input operands at the i-th bit, and 
the C; is the carry input to the i-th bit. (Note that the equa- 
tions for this adder are iterative in nature and each depends 
on the result of the previous lesser significant bits of the 
adder array.) 


The connection scheme shown in Figure 2 requires a ripple 
propagation time through each full adder cell. If a 16-bit adder is to 
be assembled, the carry will have to propagate through all 16 full 
adder cells. What is desired is some technique for anticipating the 
carry such that we will not have to wait for a ripple carry to 
propagate through the entire network. By using some additional 
logic, such an adder array can be constructed. This type of adder 
is usually called a carry lookahead adder. 
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Figure 2. Cascaded Full Adder Cells Connected as a Four-Bit Ripple-Carry Full Adder. 
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A FOUR-BIT CARRY LOOKAHEAD ADDER 


Looking back to the equations developed for i-th bit of an adder, 
let us now rewrite the carry equation in a slightly different form. 
When we factor the C; in this equation, the new equation be- 
comes: 


Ci44 = AjB; + Cj(Aj + Bj) 


From the above equation, let us now define two additional equa- 
tions. These are: 


G; = A;B; 

P; = Aj + B; 
With these two new auxiliary equations, we can now rewrite the 
carry equation for the i-th bit as follows: 


Ci+4 = Gi t+ PIC; 


Note that we have now developed two terms: the P; term is 
known as carry propagate and the G; term is known as carry 
generate. An anticipated carry can be generated at any stage of 
the adder by implementing the above equations and using the 
auxiliary functions P; and G; as required. 


It is interesting to note that the sum equation can also be 
written in terms of these two auxiliary equations, P; and G;. For 
this case, the equation is: 
Si = (Aj + Bi)(AjBi) © C; 

The auxiliary function G; is called carry generate, because if it is 
true, then a carry is immediately produced for the next adder 
stage. The function P; is called carry propagate because it implies 
there will be a carry into the next stage of the adder if there is a 
Carry into this stage of the adder. That is, G;, causes a carry signal 
at the i-th stage of the adder to be generated and presented to the 
next stage of the adder while P; causes an existing carry at the 
input to the i-th stage of the adder to propagate to the next stage of 
the adder. 


Let us now write all of the sum and carry equations required for a 
full four-bit lookahead carry adder. 


So = Ao ® Bo® Co 

S; = Ai ® By @ (Go + PoCo) 

So = Ao ® Bo® (G, + P,Go P,P Co) 

$3 = A3@ Bs @ (Go + P2G, + PsP,Go a P2P4P9Co) 
Ci+4 = G3 + P3Go + P3P2G, + P3P5P,Go + P3P2P,P 9Co 


An important point to note is that ALL of the sum equations and 
the final carry output equation, C;44, can be written in terms of the 
A;, B;, and Cop inputs to the four-bit adder. The configuration as 
described above is shown in Figure 3. This figure is divided into 
two parts — the upper blocks show the auxiliary function 
generator circuitry required to implement the P; and G; equations 
while the lower block implements the logic required to generate 
the sum output at each bit position. 


A serious drawback to the lookahead carry adder is that as the 
word length is increased, the carry functions become more and 
more complex, eventually becoming impractical due to the large 
number of interconnections and heavy loading of the G; and P; 
functions. The auxiliary function concept can be extended, how- 
ever, by dividing the word length into fairly small increments and 
defining blocks of auxiliary functions G and P. 


It is possible for a given block to define a function G as the carry 
out generated with the block; and P can be defined as the carry 
propagate over the block. If the block size is set at four bits, then 
the functions for G and P for this block can be defined as follows: 


G G3 “ P3Go + P3P5G, + P3P5P,Gp 
P ae P3P5P;P5 
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Figure 3. Full Four-Bit Carry-Lookahead Adder. 


it is important to note that neither of these terms involves a 
carry-in (Co) to the block, so no matter how many blocks are tied 
in an adder, all the blocks have stable G and P functions available 
in a minimum number of gate delays. 


The G and P functions can be gated to produce a carry-in to each 
four-bit block, as a function of the lesser significant blocks. The 
Carry-in to a block is therefore: 


Cy = Gp—-1 + Pp—-1Gn—2 + Pn—1Pp—2Gn-3+ ..- 
+ Pr—1Pn—2Pn—3 hai PoP, PoCo 


Finally, the carry-in to each of the bits in a four-bit block must 
include a term for the actual least significant carry-in; note, 
therefore, that the equations for the four-bit full adder presented 
above include a term for carry-in at each bit position. 


Figure 4 shows the technique for cascading typical bit slice ALUs 
such as the Am2901A or Am2903 and one Am2902A in a full 
16-bit high-speed carry lookahead connection. Figure 5 shows a 
connection scheme using only four bit slices in a 16-bit arithmetic 
logic unit connection where the carries are rippled between the 
devices. Each bit slice does use internal carry lookahead over the 
four-bit block. 
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Figure 4. Full Lookahead Carry 16-Bit Adder. 
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Figure 5. Connection of 16-Bit ALU Using Ripple Carry. 


In summary, the ripple carry method can be used in conjunction 
with the lookahead technique in several ways. 


1. Lookahead carry over sections of the adder and ripple carry 
between these sections of the adder can be used. This 
method is often the most efficient in terms of hardware for a 
given speed requirement. It does not require the use of a 
lookahead carry generator such as the Am2902A. 

2. Lookahead carry across 16-bit blocks with a ripple carry be- 
tween 16-bit blocks can be used. This technique is usually 
called two-level carry lookahead addition. This technique re- 
sults in very high-speed arithmetic function generation and 
makes a reasonable tradeoff between the speed and 
hardware for word lengths greater than 16 bits. 

3. Full lookahead carry across all levels and all block sizes can 
be used. This is the highest speed arithmetic logic unit con- 
nection scheme. For word sizes up to 64 bits, itis referred to as 
three-level lookahead carry addition. Such a 64-bit ALU re- 
quires the use of five Am2902A carry lookahead generator 
units in addition to the 16 bit slice ALU devices as shown in 
Figure 6. 


OVERFLOW 


When two’s complement numbers are added or subtracted, the 
result must lie within the range of the numbers that can be 
handled by the operand word length. Numbers are normally 
represented either as fractions with a binary point between the 
sign bit and the rest of the word, or as integers where the binary 
point is after the least significant bit. The actual choice for the 
location of the binary point is really up to the design engineer, as 


the hardware configuration required for either technique is identi- 
cal. It is also possible to use number notations that include both 
integer and fractional representations in the same numbering 
scheme. Overflow is defined as the situation in which the result of 
an arithmetic operation lies outside of the number range that can 
be represented by the number of bits in the word. For example, if 
two eight-bit numbers are added and the result does not lie within 
the number range that can be represented by an eight-bit word, 
we Say that an overflow has occurred. This can happen at either 
the positive end of the number range or at the negative end of the 
number range. The logic function that indicates that the result of 
an operation is outside of the representable number range is: 


OVR = C.@ Ces 


where C, is the carry-in to the sign bit and C,,, is the 
Carry-out of the sign bit. 


Thus, for a four-bit ALU with the sign bit in the most significant bit 
position, the two’s complement overflow can be defined as the 
Cn+4 term exclusive OR’ed with the C,,43 term. 


Putting the ALU in the Data Path of a Simple Computer 


Once the Design Engineer understands the basic configuration 
and operation of a simple high speed carry lookahead adder, he 
can begin to understand the configuration required to implement 
the data handling section of a typical computing machine. The 
simplest architecture for the data handling path of a minicomputer 
is shown in Figure 7. Here, an accumulator is used in conjunction 
with an ALU to perform a basic arithmetic/storage capability for 
data handling. The computer control unit of Figure 7 can be a 
simple or sophisticated state machine as described in Chapter 2. 
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Figure 6. 64-Bit ALU with Full Carry Lookahead Using 5 Am2902s and 16 4-Bit Slices. 


COMPUTER 
CONTROL 
UNIT 


DATA-IN DATA-OUT OP CODE 


MPR-526 





Figure 7. Basic Computer Data Path. 


While the introductory material of this chapter concentrated on full 
adders, it should be understood that more ALU functions than 
addition are required if we are in to implement the data path of a 
typical minicomputer. Typically, some or all of the functions 
shown in Figure 8 are needed if we are to implement a powerful 
data handling capability. 


The operation of the ALU/accumulator configuration shown in 
Figure 7 can be described as follows. The accumulator can be 
loaded by bringing data in from the data-in port through the A 
input of the ALU, passed through the ALU and loaded into the 
accumulator. A second word of data can be presented at the 
data-in port to the A input of the ALU and the ALU can be used to 
perform an operation such as A + B, AOR B, AANDB, A — B and 
so forth. The results of this ALU operation can then be placed into 
the accumulator. The accumulator output is available at the 
data-out port for use elsewhere. Additional ALU functions such as 


A + B + CARRY 

A —-B-— 1+ CARRY 
AV B 

AAB 

AVY B 

B + CARRY 

ZERO 

PASS A 


Figure 8. Basic ALU Instructions. 


those shown in Figure 8 are easily implemented by adding some 
additional circuitry to the four-bit carry look ahead adder shown in 
Figure 3. If this circuitry is added, we will arrive at a logic diagram 
as shown in Figure 9. This diagram certainly is familiar to most 
CPU designers and is the well known Am74S181 four-bit arith- 
metic logic unit/function generator. 


Once the operation of the simple computer data path as shown in 
Figure 7 is understood, the Design Engineer will soon recognize 
the need for additional registers if our machine is to be general 
purpose and execute instructions. Very rapidly the need arises for 
a register to hold a program counter (PC) and a memory address 
register (MAR). The purpose of the program counter is to point to 
the address of the next instruction in main memory. Typically it is 
loaded into the memory address register which actually provides 
the address on to the address bus of the machine. Then, the 
program counter is incremented through the ALU and stored until 
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Figure 9. Logic Diagram for Am25LS181. 
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Figure 10.Three Register Computer Data Path. 


it is needed again. The block diagram of Figure 10 shows these 
additional registers connected in parallel at the output of the ALU. 
This ALU output is called the F bus. Each of these registers (the 
accumulator, the PC, and the MAR) has an enable input from the 
CCU so that they can selectively be loaded with data from the 
ALU. In addition, each of these registers has an output enable 
such that they can be selectively enabled onto the D bus. The D 
bus represents the data output path from the basic computer data 


path and also is used as one of the inputs to the actual ALU/func- 
tion generator. The other input in this example is called the R bus 
and comes directly from the main memory data output as well as 
from the I/O data input. As shown in Figure 10, the memory 
address register (MAR) has a second output that is used to drive 
the address bus. In this example, this register always contains the 
address to be applied to the external memory whether it be the 
address of data or the address of an instruction. 
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The best way to understand the operation of this single ALU/three 
register machine is to take an example. Let us assume we have 
just completed the execution of one machine instruction and are 
ready to fetch the next instruction. The first operation would be to 
transfer the current value of the program counter onto the D bus 
through the ALU onto the F bus and into the memory address 
register. This might be accomplished during one microcycle. The 
second operation might be to again put the PC on the D bus, pass 
itthrough the ALU B port and increment the value at the B port and 
reload it into the PC register. Thus, the PC has again been 
updated to point to the address of the next intruction. During this 
time, the address from the MAR is on the address bus and we are 
fetching data from the external memory and placing it on the R 
bus. The third microcycle would be to bring the data out of the 
external memory and pass it to the instruction register in the CCU. 
The next microcycle might be to decode this instruction and 
determine that the next word after the current instruction in mem- 
ory (an immediate operation) is to be added to the value currently 
in the accumulator. Thus, we would again need to place the PC 
into the MAR on one cycle and then increment the PC on the next 
cycle. Following this, the data from the external memory could be 
brought to the R bus through the A port of the ALU and added to 
the accumulator value which is placed on the D bus and brought 
through the B port of the ALU. The result would be placed in the 
accumulator. This operation would complete the example and we 
would be ready to fetch the next instruction. As can be seen, a 
number of microcycles are required to fetch the instruction, de- 
code it, fetch the data and execute the instruction. One of the best 
ways to understand the flow needed to implement a typical in- 
struction set is shown in Figure 11. Here, we see the basic 
instruction fetch and decode operation followed by the path used 
to execute each of the various instructions. Then, we see a return 
to the fetch operation to fetch the next instruction. 


Certainly from this discussion we can see how three registers 
have enhanced the performance of the simple ALU/accumulator 
data path shown in Figure 7. Typically, even more registers than 
shown in Figure 10 are needed if we are to increase the power of 
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Figure 11. Steps for ADD Instruction. 





our machine. If we examine the block diagram of Figure 12, we 
see a similar architecture to that as shown in Figure 10. Here, the 
number of working registers has been expanded to sixteen at the 
output of the ALU. These can be used to provide a program 
counter function and a number of accumulator functions simul- 
taneously. In addition, note that the registers have two output 
ports such that the simultaneous selection of any two of the 
sixteen registers is possible. Both of these registers can be pre- 
sented to the ALU so that operations on two registers simultane- 
ously can be executed. In addition, a data input multiplexer is 
available at the A port of the ALU such that external data can be 
brought in to the configuration. Likewise, there is an output multi- 
plexer such that either the A output of the registers or the ALU 
output can be selected. This output multiplexer is used to provide 
a data out port and the output can also be loaded into memory 
address register to provide an address as required. Thus, the 
architecture of Figure 12 is quite similar to that of Figure 10 except 
that the number of registers has been increased to provide addi- 
tional flexibility. 


lf we assume that one of the sixteen registers inside of this 
register file is to be used as the program counter, we see that the 
program counter can be brought out of the A output port and 
loaded into the memory address register and at the same time it 
can also be brought out the B output port and incremented in ALU 
and reloaded into the register file. In this architecture it appears 
the A output of the register stack can also be brought to the input 
multiplexer and the A port of the ALU and incremented via that 
path and reloaded into the registers. While this is possible in the 
architecture of Figure 12, we are leading up to the implementation 
of an Am2901A and this path is not needed in the Am2901A. 
Thus, we can implement functions and operations in the diagram 
of Figure 12 just as we could in the diagram of Figure 10. How- 
ever, what was previously performed in two microcycles can now 
be performed in one microcycle. That is, the MAR can be loaded 
with the current value of the PC and at the same time the PC can 
be incremented and the new value restored in the PC register. 
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Figure 12. Multi-Register ALU. 
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Another feature of the block diagram of Figure 12 is the depiction 
of the carry in bit to the ALU and the four output flags associated 
with the ALU. Here, carry in is the normal carry in as needed in 
any adder such that the device is cascadable. In addition, certain 
kinds of arithmetic functions such two’s complement arithmetic 
also need the ability to provide a carry in for certain operations. 
The most common is two’s complement subtract which is usually 
performed by complementing the operand to be subtracted, ad- 
ding and adding one at the carry in. Also, the ALU shows the four 
output flags usually associated with a typical minicomputer. 
These are the carry output, the sign bit, the overflow detect, and 
the zero detect. These four status flags are used to determine 
various things about the operation being performed. The carry out 
flag and overflow flag are as described in the previous sections of 
this chapter. They provide the carry and overflow information 
about the addition. 


The sign bit is simply the most significant bit of the ALU and 
represents the sign of a two’s complement number. That is, when 
the sign bit is LOW, we assume the two’s complement number is 
positive and when the sign bit is HIGH, we assume the two’s 
complement number is negative. Thus, the sign bit is active HIGH 
and carries negative weight as we assume in any standard two’s 
complement number representation. If the reader is unfamiliar 
with two’s complement number notations, a discussion of this 
topic can be found in an application note entitled “The Am25S05, 
Am2505 and Am25L05 Schottky, Standard and Low Power TTL 
Two’s Complement Digital Multipliers” as found in Advanced 
Micro Devices’ Schottky and Low Power Schottky Data Book 
dated 10/77. This application note begins on page 5-49 and fully 
details two’s complement number notation and gives examples. 


The fourth status flag is called the zero flag and again is just what 
the name implies. This flag represents the fact that all of the ALU 
outputs are at logic Zero. In this design, a logic zero means that all 
of the ALU output bits are LOW. 


If the architecture of Figure 12 is extended a little more, we will 
arrive at the Am2901A as depicted in Figure 13. Here, we have 
redrawn the structure so that the registers are placed above the 
ALU; however, the function is identical. Two new functions have 
been added to this block diagram that have not previously been 
discussed. These are the RAM shift matrix located directly above 
the sixteen registers now described as a 16 x 4 dual port RAM. 
The purpose of the RAM shift network is to allow the ability of 
shifting the data word to be written into the register either up one 
bit position or down one bit position. The second function added 
to the block diagram is that of the Q register and shift network. 
Here, the Q register is used as an auxiliary register such that 
double length operations can be performed and it is also used in 
the multiply and divide algorithms. In addition, the shift network 
allows the Q register contents to be shifted up one bit position or 
shifted down one bit position. In addition, it should be pointed out 
that the memory address register is not part of the Am2901A. This 
is because there were not enough pins on the package to imple- 
ment the function and the additional power required by the output 
buffers would have reduced the performance of the ALU and 
register stack. Instead, this function is being designed into other 
2900 family products. 


Am2901A ARCHITECTURE 


A detailed block diagram of the Am2901A bipolar micropro- 
grammable microprocessor structure is shown in Figure 14. The 
circuit is a four-bit slice cascadable to any number of bits. There- 
fore, all data paths within the circuit are four bits wide. The two key 
elements in the Figure 14 block diagram are the 16-word by 4-bit 
2-port RAM and the high-speed ALU. 
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Figure 13. Am2901A Block Diagram. 


Data in any of the 16 words of the Random Access Memory 
(RAM) can be read from the A-port of the RAM as controlled by 
the 4-bit A address field input. Likewise, data in any of the 16 
words of the RAM as defined by the B address field input can be 
simultaneously read from the B-port of the RAM. The same code 
can be applied to the A select field and B select field in which case 
the identical file data will appear at both the RAM A-port and 
B-port outputs simultaneously. 


When enabled by the RAM write enable (RAM EN), new data is 
always written into the file (word) defined by the B address field of 
the RAM. The RAM data input field is driven by a 3-input multi- 
plexer. This configuration is used to shift the ALU output data (F) if 
desired. This three-input multiplexer scheme allows the data to 
be shifted up one bit position, shifted down one bit position, or not 
shifted in either direction. 


The RAM A-port data outputs and RAM B-port data outputs drive 
separate 4-bit latches. These latches hold the RAM data while the 
clock input is LOW. This eliminates any possible race conditions 
that could occur while new data is being written into the RAM. 


The high-speed Arithmetic Logic Unit (ALU) can perform three 
binary arithmetic and five logic operations on the two 4-bit input 
words R and S. The R input field is driven from a 2-input multi- 
plexer, while the S input field is driven from a 3-input multiplexer. 
Both multiplexers also have an inhibit capability; that is, no data is 
passed. This is equivalent to a “zero” source operand. 


Referring to Figure 14, the ALU R-input multiplexer has the RAM 
A-port and the direct data inpDuts (D) connected as inputs. 
Likewise, the ALU S-input multiplexer has the RAM A-port, the 
RAM B-port and the Q register connected as inputs. 
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This multiplexer scheme gives the capability of selecting various 
pairs of the A, B, D, Q and “0” inputs as source operands to the 
ALU. These five inputs, when taken two at a time, result in ten 
possible combinations of source operand pairs. These combina- 
tions include AB, AD, AQ, AO, BD, BQ, BO, DQ, DO and QO. It is 
apparent that AD, AQ and AO are somewhat redundant with BD, 
BQ and BO in that if the A address and B address are the same, 
the identical function results. Thus, there are only seven com- 
pletely non-redundant source operand pairs for the ALU. The 
Am2901A microprocessor implements eight of these pairs. The 
microinstruction inputs used to select the ALU source operands 
are the Ip, I, and lp inputs. 


The two source operands not fully described as yet are the D input 
and Q input. The D input is the four-bit wide direct data field input. 
This port is used to insert all data into the working registers inside 
the device. Likewise, this input can be used in the ALU to modify 
any of the internal data files. The Q register is a separate 4-bit file 
intended primarily for multiplication and division routines but it 
can also be used as an accumulator or holding register for some 
applications. 


The ALU itself is a high-speed arithmetic/logic operator capable 
of performing three binary arithmetic and five logic functions. The 
13, 14 and Is microinstruction inputs are used to select the ALU 
function. The definition of these functions is shown in Figure 15. 
The normal technique for cascading the ALU of several devices is 
in a look-ahead carry mode. Carry generate, G, and carry propa- 
gate, P, are outputs of the device for use with a carry-look- 


ahead-generator such as the Am2902A (’182). Acarry-out,C,44, ” 


is also generated and is available as an output for use as the carry 
flag in a status register. Both carry-in (C,,) and carry-out (C,44) 
are active HIGH. 
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Figure 15. Am2901A Microinstruction Controi. 


The ALU has three other status-oriented outputs. These are F3, F 
= 0, and overflow (OVR). The F3 output is the most significant 
(sign) bit of the ALU and can be used to determine positive or 
negative results without enabling the three-state data outputs. F3 
is non-inverted with respect to the sign bit output Y3. The F = 0 
output is used for zero detect. It is an open-collector output and 
can be wire OR’ed between microprocessor slices. F = 0 is HIGH 
when all F outputs are LOW. The overflow output (OVR) is used to 
flag arithmetic operations that exceed the available two’s com- 
plement number range. The overflow output (OVR) is HIGH when 
overflow exists; that is, when C,43 and C,.,4 are not the same 


polarity. 





The ALU data output is routed to several destinations. It can be a 
data output of the device and it can also be stored in the RAM or 
the Q register. Eight possible combinations of ALU destination 
functions are available as defined by the Ig, |, and lg micro- 
instruction inputs. These combinations are shown in Figure 15. 


The four-bit data output field (Y) features three-state outputs and 
can be directly bus organized. An output control (OE) is used to 
enable the three-state outputs. When OE is HIGH, the Y outputs 
are in the high-impedance state. 


A two-input multiplexer is also used at the data output such that 
either the A-port of the RAM or the ALU outputs (F) are selected at 
the device Y outputs. This selection is controlled by the lg, l7 and 
lg microinstruction inputs. 


As was discussed previously, the RAM inputs are driven from a 
three-input multiplexer. This allows the ALU outputs to be entered 
non-shifted, shifted up one position (X2) or shifted down one 
position (+2). The shifter has two ports; one is labeled RAMpy and 
the other is labeled RAM3. Both of these ports consist of a 
buffer-driver with a three-state output and an input to the multi- 
plexer. Thus, in the shift up mode, the RAMs; buffer is enabled and 
the RAMp multiplexer input is enabled. Likewise, in the shift down 
mode, the RAM) buffer and RAM, input are enabled. In the 
no-shift mode, both buffers are in the high-impedance state and 
the multiplexer inputs are not selected. This shifter is controlled 
from the Ig, lz and lg microinstruction inputs. 


Similarly, the Q register is driven from a 3-input multiplexer. In the 
no-shift mode, the multiplexer enters the ALU data into the Q 
register. In either the shift-up or shift-down mode, the multiplexer 
selects the Q register data appropriately shifted up or down. The 
Q shifter also has two ports; one is labeled Qy and the other is Q3. 
The operation of these two ports is similar to the RAM shifter and 
is also controlled from Ig, lz and Ig. 


The clock input to the Am2901A controls the RAM, the Q register, 
and the A and B data latches. When enabled, data is clocked into 
the Q register on the LOW-to-HIGH transition of the clock. When 
the clock input is HIGH, the A andB latches are open and will pass 
whatever data is present at the RAM outputs. When the clock 
input is LOW, the latches are closed and will retain the last data 
entered. If the RAM-EN is enabled, new data will be written into 
the RAM file (word) defined by the B address field when the clock 
input is LOW. 


Am2903 GENERAL DESCRIPTION 


The Am2903 is a four-bit expandable bipolar microprocessor 
slice that performs all functions performed by the industry stan- 
dard Am2901A. In addition, it provides a number of significant 
enhancements that are especially useful in arithmetic oriented 
processors. The Am2903 contains sixteen internal working re- 
gisters arranged in atwo address architecture and it also provides 
all of the necessary signals to expand the register file externally 
using the Am29705 register stack. Any number of registers can 
be cascaded to the Am2903 using this technique. In addition to its 
complete arithmetic and logic instruction set, the Am2903 pro- 
vides a special set of instructions which facilitate the implementa- 
tion of multiplication, division, normalization and other previously 
time consuming operations such as parity generation and sign 
extension. A block diagram of the Am2903 is shown in Figure 16. 


ARCHITECTURE OF THE Am2903 


The Am2903 is a high-performance, cascadable, four-bit bipolar 
microprocessor slice designed for use in CPU’s, peripheral con- 
trollers, microprogrammable machines, and numerous other ap- 
plications. The microinstruction flexibility of the Am2903 allows 
the efficient emulation of almost any digital computing machine. 


101 


102 


DATA IN 


A 8 
ADDRESS ADDRESS 


RAM WRITE 
ENABLE 


A B 
DATA OUT DATA OUT 


INSTRUCTION 
DECOD 


Figure 16. Basic Am2903 Block Diagram. 


The nine-bit microinstruction selects the ALU sources, function, 
and destination. The Am2903 is cascadable with full lookahead or 
ripple carry, has three-state outputs, and provides various ALU 
status flag outputs. Advanced Low-Power Schottky processing is 
used to fabricate this 48-pin LSI circuit. 


All data paths within the device are four bits wide. As shown in the 
block diagram of Figure 16, the device consists of a 16-word by 
4-bit, two-port RAM with latches on both output ports, a high-per- 


formance ALU and shifter, a multi-purpose Q Register with shifter 


input, and a nine-bit instruction decoder. 
Two-Port RAM 


Any two RAM words addressed at the A and B address ports can 
be read simultaneously at the respective RAM A and B output 
ports. Identical data appear at the two output ports when the 
same address is applied to both address ports. The latches at the 
RAM output ports are transparent when the clock input, CP, is 
HIGH and they hold the RAM output data when CP is LOW. Under 
control of the OE, three-state output enable, RAM data can be 
read directly at the Am2903 DB 1/O port. 





External data at the Am2903 Y I/O port can be written directly into 
the RAM, or ALU shifter output data can be enabled onto the Y I/O 
port and entered into the RAM. Data is written into the RAM at the 
B address when the write enable input, WE, is LOW and the clock 
input, CP, is LOW. 


Arithmetic Logic Unit 


The Am2903 high-performance ALU can perform seven arithme- 
tic and nine logic operations on two 4-bit operands. Multiplexers 
at the ALU inputs provide the capability to select various pairs of 
ALU source operands. The E, input selects either the DA exter- 
nal data input or RAM output port A for use as one ALU operand 
and the OEg and Ip inputs select RAM output port B, DB external 
data input, or the Q Register content for use as the second ALU 
operand. Also, during some ALU operations, zeros are forced at 
the ALU operand inputs. Thus, the Am2903 ALU can operate on 
data from two external sources, from an internal and external 
source, or from two internal sources. 


When instruction bits I4, I3, lo, ly and Io are LOW, the Am2903 
executes special functions. Figure 17 defines these special func- 
tions and the operation which the ALU performs for each. When 
the Am2903 executes instructions other than the nine special 
functions, the ALU operation is determined by instruction bits I, 
I3, lo and I,. Figure 18 defines the ALU operation as a function of 
these four instruction bits. 


Am2903s may be cascaded in either a ripple carry or lookahead 
carry fashion. When a number of Am2903s-are cascaded, each 
slice must be programmed to be a most significant slice (MSS), 
intermediate slice (IS), or least significant slice (LSS) of the array. 
The carry generate, G, and carry propagate, P, signals required 
for a lookahead carry scheme are generated by the Am2903 and 
are available as ontputs of the least significant and intermediate 
slices. 


The Am2903 also generates a carry-out signal, C,44, which is 
generally available as an output of each slice. Both the carry-in, 
C,, and carry-out, C,.4, signals are active HIGH. The ALU 
generates two other status outputs. These are negative, N, and 
overflow, OVR. The N output is generally the most significant 
(sign) bit of the ALU output and can be used to determine positive 
or negative results. The OVR output indicates that the arithmetic 
operation being performed exceeds the available two’s comple- 
ment number range. The N and OVR signals are available as 
outputs of the most significant slice. Thus, the multi-purpose G/N 
and P/OVR outputs indicate G and P at the least significant and 
intermediate slices, and sign and overflow at the most significant 
slice. To some extent, the meaning of the C,,,.4, P/OVR, and G/N 
signals vary with the ALU function being performed. 


ALU Shifter 


Under instruction control, the ALU shifter passes the ALU output 
(F) non-shifted, shifts it up one bit position (2F), or shifts it down 
one bit position (F/2). Both arithmetic and logical shift operations 
are possible. An arithmetic shift operation shifts data around the 
most significant (sign) bit position of the most significant slice, and 
a logical shift operation shifts data through this bit position (see 
Figure 19). SIOg and SIO3 are bidirectional serial shift inputs/out- 
puts. During a shift-up operation, SIO, is generally a serial shift 
input and SIO, a serial shift output. During a shift-down operation, 
S103 is generally a serial shift input and SIO9 a Serial shift output. 


The ALU shifter also provides the capability to sign extend at slice 
boundaries. Under instruction control, the SIO (sign) input can 
be extended through Yo, Y;, Y2, Y3 and propagated to the SIO; 
output. 
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Figure 17. Special Functions: |p = |, = lb = ly = |, = LOW, IEN = LOW. 
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Figure 18. ALU Functions. 


A cascadable, five-bit parity generator/checker is designed into 
the Am2903 ALU shifter and provides ALU error detection capa- 
bility. Parity for the Fo, F;, Fo, F3 ALU outputs and SIO3 input is 
generated and, under instruction control, is made available at the 
SIOpo output. 
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Figure 19. 


The instruction inputs determine the ALU shifter operation. Figure 
17 defines the special functions and the operation the ALU shifter 
performs for each. When the Am2903 executes instructions other 
than the nine special functions, the ALU shifter operation is de- 
termined by instruction bits Iglzigls. Figure 20 defines the ALU 
shifter operation as a function of these four bits. 


Q Register 


The Q Register is an auxiliary four-bit register which is clocked on 
the LOW-to-HIGH transition of the CP input. It is intended primar- 
ily for use in multiplication and division operations; however, itcan 
also be used as an accumulator or holding register for some 
applications. The ALU output, F, can be loaded into the Q Regis- 
ter, and/or the Q Register can be selected as the source for the 
ALU S operand. The shifter at the input to the Q Register provides 





en 
a 
Pon 
a 


Anth F/2-+Y 


| 5 | 
3 
ALU Shifter Most Sig. | Other | Most Sig. | Other 
Function Slice Slices Slice 
F3 


Most Sig | Other Shifter 








Log F/2-+Y 








Anth F/2-+Y 





Log F/2-+Y 


Function 
Fo 
Fo 
Fo 
Fo 














Parity 
Parity 
Parity 











Anth 2F-+Y 





log 2F-+Y 





Anth 2F--Y 








Log 2F+Y 


| Log 2Q-Q | 2Q-Q Q3 








L 
L 
L 
L 
L 
L 
L 
H 
H 
H 
H 


[tog 200 [05 | Tut | 





L 
L 
L 
L 
H 
H 
H 
H 
L 
L 
L 
L 
H 
H 
H 


Xprp er] oe) el ml ec] cp el eel zc] rye ee 
ba a” Oe Oe 8 ee 8 0 el De Od 


SIOg— Yo, Y1, Ya, Y3 





aieleteletalefel-felel=|-f4)-141 8 


i 


Parity = F3 ¥ Fo ¥ Fy ¥ Fo ¥ SIO3 
¥ = Exclusive OR 








Hz 


Hi-Z = High Impedance 


Figure 20a. ALU Destination Control for Ip or 1; or lp or Iz or I, = HIGH, IEN = LOW. 
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Figure 20b. Am2903 ALU Destination Control Summary. 


the capability to shift the Q Register contents up one bit position 
(2Q) or down one bit position (Q/2). Only logical shifts are per- 
formed. QIO 9 and QIO3 are bidirectional shift serial inputs/out- 
puts. During a Q Register shift-up operation, QIOp is a serial shift 
input and QIO3 is a serial shift output. During a shift-down opera- 
tion, QIO3 is a serial shift input and QIOp is a serial shift output. 


Double-length arithmetic and logical shifting capability is pro- 
vided by the Am2903. The double-length shift is performed by 
connecting QIO3 of the most significant slice to SIOg of the least 
significant slice, and executing an instruction which shifts both the 
ALU output and the Q Register. 


The Q Register and shifter operation is controlled by instruction 
bits Iglzlgls. Figures 17 and 20 define the Q Register and shifter 
operation as a function of these four bits. 


Output Buffers 


The DB and Y ports are bidirectional I/O ports driven by three- 
state output buffers with external output enable controls. The Y 
output buffers are enabled when the OEy input is LOW and are in 
the high-impedance state when OEy is HIGH. Likewise, the DB 
output buffers are enabled when the OEg input is LOW and in the 
high-impedance state when OEg is HIGH. 


The zero, Z, pin is an open collector input/output that can be 
wire-OR’ed between slices. As an output it can be used as a zero 
detect status flag and generally indicates that the Yo.3 pins are all 
LOW, whether they are driven from the Y output buffers orfrom an 
external source connected to the Yo.3 pins. To some extent the 
meaning of this signal varies with the instruction being performed. 


Instruction Decoder 


The Instruction Decoder generates required internal control sig- 
nals as a function of the nine Instruction inputs, lp.g; the Instruc- 
tion Enable input, IEN; the LSS input; and the WRITE/MSS in- 
put/output. The WRITE output is LOW when an instruction which 
writes data into the RAM is being executed. 


When IEN is LOW, the WRITE output is enabled and the Q 
Register and Sign Compare Flip-Flop can be written according to 
the Am2903 instruction. The Sign Compare Flip-Flop is an on- 
chip flip-flop which is used during an Am2903 divide operation. 





Programming the Am2903 Slice Position 


Tying the LSS input LOW programs the slice to operate as a least 
Significant slice (LSS) and enables the WRITE output signal onto 
the WRITE/MSS bidirectional I/O pin. When LSS is tied HIGH, the 
WRITE/MSS pin becomes an input pin; tying the WRITE/MSS pin 
HIGH programs the slice to operate as an intermediate slice (IS) 
and tying it LOW programs the slice to operate as a most signifi- 
cant slice (MSS). This is shown in Figure 21. 
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Figure 21. Am2903 — 16-Bit CPU with Carry Look Ahead. 


EXPANDING THE NUMBER OF Am2903 REGISTERS 


The Am2903 contains 16 internal working registers configured in 
a standard two port architecture. The number of working registers 
in the ALU configuration can be increased by utilizing the 
Am29705 16-word by 4-bit two-port RAM. Any number of 
Am29705’s can be connected to the Am2903 to increase the 
number of working registers. Figure 22 shows a block diagram of 
the basic Am29705. As is seen, the device consists of a 16 word 
by 4 bit two port RAM with latches at the A and B outputs similar to = a 
the RAM contained within the Am2903. Each of the latch outputs we 
has three state drivers capable of driving the DA and DB inputs of 

the Am2903. The Am29705 is a non-inverting device. That is, 
data presented at the inputs is stored in the RAM and when 
brought to the RAM outputs, it is non-inverted from when it was 
orginally brought into the device. 


DATA IN 


The technique for using the Am29705 to expand the number of 
registers in the Am2903 can best be visualized by referring to 
Figures 23 and 24 simultaneously. In Figure 23, the data bus 
connections are shown such that the Am2903 Y output is used to 
drive the Am29705 inputs. Here, we also assume this bus may be 
tied to a data bus through a bi-directional buffer. In Figure 23, the 
A outputs of the Am29705 are connected together and also 
connected to the DA input of the Am2903. Likewise, the B outputs 
from the Am29705 are also shown connected to the DB inputs of 
the Am2903. In all cases, we are assuming 16-bit data busses. 
Thus, four Am2903’s are assumed and eight Am29705’s are 
assumed. As shown in Figure 23, one of the write enable inputs to 
the Am29705 is tied to the latch enable input of the Am29705 and 
these pins are also tied to the clock input of the Am2903. This 
allows the latches in the Am29705 to perform identically to those 
in the Am2903. Figure 22. Am29705 Block Diagram. 
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If we refer to Figure 24, we see the connections required to set up 
the addressing for additional registers associated with the 
Am2903. Here, three two-line to four-line decoders are used to 
properly control the A address, B address and write enable sig- 
nals to the devices. As shown in Figure 24, the four A address 
lines are all tied in parallel between the Am2903 and the 
Am29705's. The two-line to four-line decoder is used to enable 
the appropriate output enable from the Am29705’s or switch the 
EA MUX inside the Am2903 such that the proper register is 
selected. The B address operates in a similar fashion in that the 
four B address lines are also all tied together. Likewise, a two-line 
to four-line decoder is used to properly select the output enable of 
either the Am29705’s or the Am2903 such that the correct source 
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Figure 23. Am2903 — Data Bus Cascading. 
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operand register is selected. In addition, a two-line to four-line 
decoder is used to control the write enable signal such that only 
One register is written into as a destination. This is controlled by 
properly selecting the write enable of either the Am2903 or the 
Am29705 as determined by the two most significant bits of the B 
address. 


If this technique is used properly, any number of Am29705’s can 
be used in conjunction with the Am2903. It may be necessary to 
use either a three-line to eight-line decoder or perhaps even a 
larger circuit to decode the more significant bits of the A and B 
addresses. Likewise, the write enable signal must be controlled 
so that the correct destination register will be written. 


UNDERSTANDING BIT SLICE TIMING 


Perhaps one of the most important aspects of designing with 
either the Am2901A or the Am2903 is understanding the calcula- 
tions required to compute the worst case AC performance. In 
order to perform these calculations, we have selected a number 
of standard Schottky devices and assigned minimum, typical and 
maximum speeds at 25°C and 5V for use in these calculations as 
shown in Figure 25. Certainly the design engineer should use the 
exact specifications of the devices he has selected for his design 
in order to perform the worst case calculations. What is intended 
here is an understanding of the technique to perform these calcu- 
lations and some method to allow a comparison of the Am2901A 
and Am2903 in terms of their AC performance. Since at the time 
of this writing the Am2903 is still being characterized, only the 
typical AC data is currently available. Thus, all calculations will be 
made using the typical AC times such that we can compare the 
Am2901A with the Am2903. When final characterization data on 
the Am2903 is available, the designer can then compute his 
performance by selecting the appropriate temperature range and 
power supply variations as required by his design. 


Figure 26 shows the typical AC calculations for the functions 
usually considered in an Am2901A design. These functions are 
usually the speed for a logic operation, arithmetic operation, logic 
operation with shift and arithmetic operation with shift. In each 
case, we are computing speeds from the LOW-to-HIGH transition 
of a clock through an entire microcycle to the next LOW-to-HIGH 
transition of a clock. 
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Figure 24. Am2903 — RAM Address Cascading. 
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Figure 25. Standard Device Schottky Speeds. 


Similarly, Figure 27 shows the same type of computations for an 
Am2903 system. There is one very important distinction that 
should be made in computing the timing of an Am2903 16-bit ALU 
when compared with an Am2901A ALU in that in the Am2903, the 
shifter is at the output of the ALU and is followed by the zero 
detector. Thus, in an Am2903 design, the flags are no longer 


independent of the shift operation. This is easily seen in Figure 
27. 


By way of comparison, Figure 28 shows speeds for the four types 
of operations for the Am2901A 16-bit system as compared with 
the Am2903 16-bit system. 
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Figure 28. Summary of Am2901A and Am2903 AC 
Performance in a 16-Bit Configuration. 


USING THE Am2903 IN A 16-BIT DESIGN 


Perhaps the best technique for understanding the design of the 
16-bit ALU is to simply take an example. Figure 29 shows a block 
diagram overview of four Am2903’s with the appropriate shift 
matrix control, status register, MAR and the usual interface to a 
CCU and main memory. This block diagram represents the nor- 
mal data handling path associated with a simple 16-bit minicom- 
puter. If we expand this block diagram to show what would nor- 
mally be considered to be the complete 16-bit central processing 
unit, the block diagram of Figure 30 results. Here, we see the 
Am2903’s surrounded by a typical set of MSI support chips. In 
addition, the block diagram shows a typical computer control unit 
as described in Chapter 2 of this series. Thus, all of the blocks are 
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now in place to show a simple 16-bit microcomputer built using 
the Am2900 family devices. The full design for such a machine is 
shown in Figure 31. 


Figures 31A, Figure 31B and Figure 31C detail the connection of 
each IC used in this design. Quite simply, the design can be 
described as follows. Figure 31A represents the microprogram 
sequencer portion of the design. U1, U2 and U3 are the instruc- 
tion register that receive a 16-bit instruction from main memory. 
U4, U5 and U6 are the mapping PROMs used to decode the OP 
code portion of the instruction to arrive at a starting address for 
the microprogram sequencer. The microprogram sequencer is 
the Am2910 and is shown as U7. The branch address pipeline 
register is U8, U9 and U10 and can be enabied to the D inputs of 
the Am2910 sequencer to provide the jump address from micro- 
code. The pipeline register for the instruction inputs to the 
Am2910 is U14. This machine also has the ability to select the A 
and B addresses for the Am2903 devices from the microprogram 
as well as the instruction register and U11 and U12 provide this 
capability as a part of the pipeline register. U13 is a two line to four 
line decoder used as part of the control for the A and B address 
select for the Am2903’s. U15 is part of the pipeline register and 
provides both true and complement outputs for bit 11. U16 and 
U17 represent a one of sixteen decoder whose output can be 
applied to the DA bus to allow the implementation of all the bit 
operations. These include bit set, bit clear, bit toggle and bit test. 
U18 and U19 are PROM's that provide the ability to enter one of 
thirty-two preprogrammed constants onto the DA bus. 


Figure 31B is predominately the data handling portion of the 
design. Here, U20 and U21 represent a data register that re- 
ceives data from the data bus. U26, U27, U28 and U29 are the 
four Am2903’s that form a 16-bit register/ALU combination. U30 
Is the carry look ahead generator for the ALU section. U22, U23 
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Figure 29. Am2903 with Shift Mux and Status Register. 


113 


DATA BUS 










INSTRUCTION 
REGISTER 










| 


DATA OUT 
REGISTER 


DATA IN 
REGISTER 


CONDITION 
CODE 
MUX 

















MICROPROGRAM 
CONTROLLER 
Am2910 


DECODER 
CONSTANT 
PROM 








F 
L 
A 
G 







FUTURE 


REGISTER 
EXPANSION . 









MICROPROGRAM MEMORY 













27813 
PROM’S 


29775 
PROM’S 


4 
ge 
x2 

o 
Ow 
<a 





SUPER SLICES 


2 
SHIFT 
CONTROL 
PIPELINE PIPELINE 
REGISTER REGISTER 


REGISTERS 


CONTROL 
BITS 


CONTROL 
BITS 


Figure 30. 


vit 


MPR-546 


ADDRESS BUS 


and U24 represent the status register with the ability to save and 
restore the flags in main memory. U25 is the condition code 
multiplexer for the microprogram sequencer. U33, U34, U35 and 
U36 represent the shift linkage multiplexers that tie together the 
internal shifters within the Am2903’s. U37 is part of the pipeline 
register and provides both true and complement outputs of a 
number of the microprogram bits. U38 is part of the carry in logic 
control such that double length arithmetic operations can be 
performed. U31 and U32 are the data out register that can be 
used to accept data from the Am2903s and enable this data onto 
the data bus. U39 and U40 represent the memory address regis- 
ter and are used to hold the address provided from the CPU to 
main memory. 


The microprogram store is shown in Figure 31C. Here, we have 
used both the 512 x 8 registered PROM’s and 512 x 4 non- 
registered PROM’s in this design. A total of 68 microprogram bits 
have been depicted in this design. These are shown so that 
maximum flexibility is achieved. In most typical designs some 10 
to 20 of these bits would not be used. Figure 31C shows four 
512-word by 8-bit registered PROM’s (U41, U42, U43 and U44). It 
also shows nine 512-word by 4-bit PROM’s represented as U45 
through U53. 


Perhaps the best way to review the design is to simply understand 
the function of each of the microprogram control bits. If the pur- 
pose of each of these bits is well understood, the design engineer 
will be well along in understanding the design of the simple 
minicomputer CPU presented here. 


The Microprogram Structure 


The microprogram for the design shown in Figure 31 is 68 bits 
wide. The functions of the microprogram control bits are as fol- 
lows: 


Bits PLO The 9 instruction bits of the Am2903 super- 

through PL8 _ slices. 

Bits PL9, The IEN, EA, OEB control inputs of the 

PL10, PL11 Am2903 superslices, respectively. PL11 is also 
connected to the data-in registers (U20 and 
U21) output-enable. This connection assures 
that there will be no conflict on the DB pins. 

Bits PL12 Select the source for SIO of the Am2903, both 


through PL14 for shift-up and for shift-down operations. The 
(412 through following table summarizes the functions of 
p14) these bits. 


Microprogram Bits 
14 13 12 


SIO, SIO, 
(Shift-down) (Shift-up) 


0 0 
SIO, SIO, 
QlO, 


QIO, 

Carry Carry 

Zero Zero 

Sign Sign 

Not allocated Not allocated 
1 1 


Bits PL15 Select the source for QIO of the Am2903, both 
through PL17 for shift-up and shift-down operations. The fol- 
(u15 through lowing table summarizes the functions of 


p17) these bits. 








Microprogram Bits aio, Qlo, 
17 16 15 (Shift-down) (Shift-up) 




















0 0 
SIO, SIO, 
QIO, QIO, 
Carry Carry 
Zero Zero 
Sign Sign 


Not allocated 
1 


Not allocated 
1 





Bit PL18 


When LOW, enables the MAR clock input, i.e. 
the data appearing on the Y output pins of 
the Am2903 Superslices™ will be clocked into 
the MAR at the LOW-to-HIGH transition of 
the clock pulse. 


When LOW, enables the MAR output onto the 
Memory Address Bus. 


When LOW, enables the data output register 
clock, i.e. the data appearing in the Y output 
pins of the Am2903 Superslices™ will be 
clocked into the data output registers (U31 
and U32) at the LOW-to-HIGH transition of 
the clock pulse. 


Bit PL19 


Bit PL20 


Bit PL21 When LOW, enables the data output registers 


onto the Data Bus. 


Bit PL22 When LOW, enables the data-in register clock, 
i.e. the data appearing in the Data-Bus will be 
clocked into the data-in registers at the 


LOW-to-HIGH transition of the clock pulse. 


This is the Cl input of the Am2910 micropro- 
gram sequencer. 


Bit PL23 


Bits PL24 
through PL27 


This is a 4-bit wide field which can be used 
either for the A-address, for the B-address or 
for both A and B addresses of the Am2903 
superslices. 


Bits PL28 
through PL31 


This is a 4-bit wide field, which can be 
used for either the A-address of the Am2903 
superslice or to designate one of sixteen bits to 
the DA inputs of the Am2903 superslice via the 
Am2921’s («16 and 17). 


Select the source for the Am2903 A-address, 
according to the table below: 


Bits PL32 
and PL33 


de el A-Address Source 


Data Bus bits 0 through 3 
Microprogram bits 28 through 31 
Data Bus bits 4 through 7 
Microprogram bits 24 through 27 







Bit PL34 Selects the source of the Am2903 B-address, 


according to the table below: 


es B-Address Source 
L Data Bus bits 4 through 7 
H Microprogram bits 24 through 27 
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Bit PL35 


Bits PL36 
and PL37 












Bit PL38 


Bit PL39 


Bit PL40 


Bit PL41 


Bit PL42 


Bit PL43 


Bit PL44 


Bit PL45 


Bit PL46 


and PL47 
Bits PL48 


through PL50 


50 


L 
L 
L 
L 
H 
H 
H 
H 


Bits 
49 





Is the C,, input of the least significant Am2903 
via an Am74S157 mux (438). 


Affect the status register input signals, ac- 
cording to the table below: 


Previous Zero, 
Sign, Overflow 


Previous Carry 








Previous SIO;5 Previous Zero, 


Sign, Overflow 


Am2903 superslices’ Output 
Data Bus bits 0 through 3 


Selects either the carry flip-flop or the PL35 bit 
for carry in. 


When LOW, enables the status register output 
to the data bus bits 0 through 3. 


Controls the output polarity of the one-of-six- 
teen bit select logic. 


When LOW, enables the Instruction register 
(U1, U2, U3) clock. The data present at bits 0 
through 15 of the Data-Bus will be latched into 
the Instruction register at the next LOW-to- 
HIGH transition of the clock pulse. 


This is an output signal. When HIGH, it signals 
the main memory that a memory read is re- 
quested. 


This is an output signal. When HIGH, it signals 
to the main memory that a memory write is re- 
quested. 


Selects the source of the one of sixteen bit de- 
coders (U16 and U17) . When LOW, the output 
of the Am2919 register (U12) containing the pre- 
viously latched microprogram bits 28 through 
31 will be applied to the decoders. When HIGH, 
the output of the Am2919 register (U3) con- 
taining the previously latched Data-Bus bits 0 
through 3 will be applied to the decoders. 


Selects the Am2903 Superslices™’ DA port 
source. When LOW, the output of the one of six- 
teen bit decoder (U16 and U17) will be applied 
to that port. When HIGH, the output of the 
Am29771 PROM’s (U18 and U19) will be ap- 
plied to the Am2903 DA ports. 


These are the RLD and CCEN control inputs 
of the Am2910 sequencer, respectively. 


These select the condition code according to 
the following table: 


Condition Code Selected 
48 


Carry 
Sign 
Zero 
Overflow 


Not Allocated 


ae ee ek i, Ie 


Bit PL51 Is the condition code polarity control. When 
HIGH, the condition code selected will pass non- 
inverted. When LOW, the selected condition 
code will be complemented. 

Bits PL52 Are the | inputs of the Am2910 sequencer. 

through PL55 

Bits PL56 This is a 12-bit wide field and it serves, usu- 


through PL67_ ally as the next microprogram address.How- 
ever, the 5 least significant bits of this field (bits 
56-60) serve also as an address field of the 


Am29771 “constant” PROM’s (U18 and U19). 


Some Sample Microroutines 


Figure 32 shows the microprogram code for a few sample micro- 
routines. Different addressing schemes are demonstrated with 
the “ADD” operation. All the other arithmetic or logic operations 
can be easily programmed by substituting the |,-l, field of the 
Am2903 with the appropriate function. Since the main memory 
address is generated by the Am2903 superslices, the internal 
register No. 15 serves as the program counter. 


The following is a description of some sample microroutines. The 
reader should refer to the description of the microprogram bits 
given earlier in this chapter and to the data sheets of the Am2910 
sequencer and of the Am2903 superslice. 


Microword INIT. 


This microword should be at address 0 and when the machine is 
reset, the Am2910 will start executing from here. The purpose of 
this location is to reset the machine program counter (Register 
15) to zero. Ultimately more microinstructions can be added, 
should the necessity of other reset functions arise. 


Bits 1-4 (Am2903 1,-I4) being 8,, will cause the superslices to 
generate all zeroes at the F-points (internal). Bits 5-8 (Am2903 
I5-lg) being F,, will cause this data (all zeroes) to appear on the Y 
outputs. Bit 9 (IEN) is LOW and therefore, WRITE will be LOW 
and this data will be written into the internal register selected by 
the B-address inputs. Bit 34 is HIGH; therefore, microprogram 
bits 24-27 will be selected as B address source. Since Fy, is in 
these bits, all zeroes will be written into the program counter 
(Register 15). Bit 18 is LOW; therefore, the data at the Y outputs 
(all zeroes) wil be latched into the MAR at the next clock pulse. 
Bits 36 and 37 are set such that the flags will be updated, namely 
CY=N=OVF=0, Z=1. 


Bits 42, 43 are both LOW so no memory reference signal is sent to 
the main memory (the MAR is still in an undetermined state). Bits 
52-55 (Am2910 |) are set to E,, which will force the sequencer to 
continue to the next sequential address (1) as the Ci (bit 23) is 
HIGH. 


Bits 21 and 39 are both HIGH to ensure that there is no conflict on 
the data bus though in this case one of them could be a DON'T- 
CARE. Bit 38 could also be aDON’T-CARE as the carry is zeroed 
by the ALU. Making a HIGH in bit 46 enables executing this 
microstep without disturbing the Am2910 sequencer’s internal 
register which at power-up has no significance but may be impor- 
tant, should a software restart be issued. 


All the other bits are DON’T-CAREs. 


Microword FETCH 


This is the first step in the machine instruction fetch routine. In this 
step, the main memory is addressed by the MAR, a read signal is 
issued (bit 42 = HIGH), and the machine instruction (mac- 
roinstruction) is placed on the data bus by the memory. It is 
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Figure 32. Example Microcode for Figure 31 Design. 


Don’t Care. 


1. 4-bit fields in hex, others in octal. 


2. X 
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latched into the instruction register (U1, U2, and U3) at the next 
clock LOW-to-HIGH transition (bit 41 = LOW). It is assumed that 
if a relatively slow main memory is used, the clock is halted until 
the data is stable on the data bus and the register set up times are 
met. We will see in a later chapter how easy it is to implement this 
requirement using the Am2925 clock generator. The same as- 
sumption will also be made in a memory write cycle. 


Bit 9 (Am2903 IEN) is HIGH; thus, we don’t care what the ALU 
does during this microstep. We prevent the flags from changing 
by setting bits 36-38 LOW. Also, the registers atthe Y output have 
the E input HIGH (bits 18, 20). Bits 21 and 39 are both HIGH; thus, 
the data bus is free to accept data from the main memory (bit 42 is 
HIGH, signaling memory read request). The MAR is enabled to 
the address bus (bit 19 = LOW) and at the next clock, the 
macroinstruction will be latched into the instruction registers (bit 
41 = LOW). The Am2910 sequencer will continue to the next 
instruction (bits 52-55 = E,,). 


Microword FETCH + 1 


This is the second step in the macroinstruction fetch routine. The 
instruction already resides in the instruction registers U1, U2 and 
U3). 


The Am2910 sequencer receives a JUMP MAP instruction (bits 
52 though 55 = 2). The next microinstruction will begin to execute 
the present macroinstruction — according to the mapping PROM. 


We use this microstep to update (increment) the program counter 
(Register 15). Bit 34 being HIGH, microprogram bits 24-27 (=F) 
will be the B address. The Am2903 OEB and Ip are LOW, there- 
fore, the contents of Register 15 will serve as the S operand for 
the ALU. C, being HIGH, a 4 in the 14-14 field will increment this 
value. IEN = LOW with I;-lg = F will write this (incremented) value 
into the same register (R15). At the same time, the MAR is also 
updated (bit 18 = LOW). 


We could update the program counter and the MAR in the previ- 
ous microstep (location FETCH), but then we had to leave the 
ALU idle during this microcycle. By adopting the present scheme, 
we can overlap the first step of the macroinstruction fetch routine 
(the memory-read cycle) with the execution of the last step of the 
previous macroinstruction — provided the memory and the data 
bus are free to perform it. The JUMP MAP cycle is always neces- 
sary — and that is why we prefer to update the PC at this step. 


Microword ADD 


This is a sample register-to-register operation. The two operands 
reside in the internal registers pointed to by the two 4-bit fields of 
the macroinstruction: 


15 8 7 43 0 


1st Operand and 


2nd Operand 


OPCODE Register Number 


Destination Register 
Number 





Bits 32-33 are set LOW, instruction register bits 0-3 are selected 
as A address. Bit 34 = LOW selects instruction register bits 4-7 as 
B address (see Fig. above). Bit 1 (Ig), bit 10 (EA) and bit 11 (OEB) 
are also LOW; therefore, the contents of the selected registers 
will be presented to the ALU’s R and S inputs. Bits 1-4 (I,-I4) = 3, 
the ALU will perform: 


F = R plus S plus C,,. 
Note that bit 35 and 38 are LOW. With I,-lg (bits 5-8) = Fy, and IEN 


(bit 0) = LOW, the result will be written into the internal register 
pointed at by the B address lines. 


Bits 18 and 20 are HIGH and inhibit the MAR and the data out 
registers from being affected, while bits 36, 37 (=2) allow the 
flags to assume values according to the result of the operation. 


During the execution of the function required (ADD in this exam- 
ple) we fetch the next OP CODE from the main memory. The 
MAR is enabled to the address bus (bit 19 = LOW) anda memory 
read is requested (bit 42 = HIGH). At the end of this microstep the 
next macroinstruction will be latched into the instruction registers 
(bit 41 = LOW). 

The Am2910 sequencer is instructed to select the pipeline regis- 
ter bits 56-67 as the next microprogram address (bits 52-57 = 7, 
bit 47 = HIGH) where the location of FETCH + 1 (2 in this 
example) is written. The next step will be JUMP MAP and update 
PC. 


Microword ADD IMMEDIATE 


This 2 step microroutine adds the contents of an internal register, 
pointed at by bits 0-3 of the macroinstruction with its second word, 
placing the result into the internal register pointed at by bits 4-7 of 
the OPCODE. 


15 8 7 43 0 


Result 
Register Number 


2nd Operand 
Register Number 


OPCODE 





First word of the macroinstruction 


DATA (1st Operand) 





Second (next consecutive) word of the macroinstruction 


The first step is to read the first operand from the memory (bit 19 
= LOW, bit 42 = HIGH) and to latch it into the data-in register 
(U20 and U21) (bit 22 = LOW). At the same time the ALU updates 
(increments) the program counter (register 15) and the MAR (bit 
18 = LOW). (Compare the location FETCH + 1 ). The Am2910 
sequencer will continue to the next microprogram address (com- 
pare to location FETCH). 


Location ADDIMM + 1 is the second step of this macroinstruc- 
tion. Itis very similar to location ADD, the only difference is that bit 
11 (OEB) is HIGH, selecting the Data-in register as source for the 
ALU’s S operand. The same macroinstruction fetch overlap 
technique is used again. 


Microword ADD DiRect 


This is the starting location to execute a macroinstruction where 
the second word is the address of the operand: 


15 8 7 43 0 


Result 
Register Number 


2nd Operand 
Register Number 


OPCODE 





First word of the macroinstruotion 


Address of the 1st operand 





Second (next consecutive) word of the macroinstruction 


The first step is to read the second word of the macroinstruction 
into the Data-in register. This microword is identical to the one 
written at location ADDIMM. 


Microword ADD DIR + 1 


The Data-in register now contains the address of the operand. 
We have to transfer it into the MAR. 


With Ip (bit 0) LOW and OEB (bit 11) HIGH, the ALU’s operand will 
be the DB bus, i.e., the Data-in register. 1,-l, (bits 1-4) = 4 will 
pass this input to its output, as C,, (bit 3) is LOW. With IEN (bit 9) = 
HIGH, the WRITE line will be HIGH too, assuring that the internal 
registers maintain their contents. Since I5-lg (bits 5-8) = Fy, the 
ALU output will appear on the Am2903 Y pins. This data which is 
actually the operand address and will be transferred into the MAR 
at the next clock cycle. The Am2910 sequencer continues to the 
next consecutive microstep. 


Microword ADD DIR + 2 


Now we read in the operand from the main memory. The MAR is 
enabled to address bus (bit 19 = LOW), a memory read signal is 
issued (bit 42 = HIGH) and the data-in register’s clock is enabled 
(bit 22 = LOW). At the next LOW-to-HIGH transition of the clock, 
the operand will be placed in the data-in register. 


Meanwhile, we need to restore the address of the next mac- 
roinstruction in the MAR. Bits 32-33 = 3 select microprogram bits 
24-27 as the A address (an Fy is written there); therefore, the 
internal program counter will be addressed, as EA (bit 10) = 
LOW. The ALU performs an F = R + C, with C, (bit 35) LOW, 
thus passing the program counter contents to the output. IEN (bit 
9) = HIGH prevents disturbance of internal Am2903 registers and 
bit 18 will enable the MAR to receive the next macroinstruction 
address. 





Note that the situation now is exactly the same as after the first 
step of ADD IMMediate. The operand is in the data register and 
the MAR points to the next macroinstruction. Therefore, the 
Am2910 sequencer will address, as the next microstep, location 
ADDIMM + 1. The step after this will, of course, be FETCH + 1.A 
total of 5 microsteps were needed to execute this macroinstruc- 
tion but it occupies only 3 microprogram locations. 


It is worthwhile to note here that by adding two more Am2920 
registers between the Data-bus and the Address-bus and a 
couple of control-bits in the microprogram, we could shorten the 
microprogram by one step. In this design we chose not to do so in 
order to demonstrate the Data-bus to Address-bus path through 
the ALU. 


Microword ADD RR1 


The macroinstruction to be excuted here points to the register in 
which the first operand is written, and also into which the result 
should be written. The second 4-bit field of the OP-CODE (bits 
0-3) points to the register in which the address of the second 
operand is stored. 


15 8 7 43 0 


1st Operand and 


2nd Operand’s 
Address Register 
Number 


OPCODE Result Register 


Number 





Bits 32 and 33 are LOW. Therefore, instruction register bits 0-3 
will form the A-address. Now we take the contents of this register 
and place it in the MAR exactly the same way as we did in location 
ADD DIR + 2 with the program counter. The Am2910 continues. 


Microword ADD RR1 + 1 


Here we fetch the operand and place it in the Data-in register. At 
the same time, we restore the program counter into the MAR. 


Microword ADD RR1 + 2 


Bits 32, 33 = 2 and instruction register bits 4-7 serve as the 
A-address. Bit 34 = LOW; the same instruction register bits serve 
as B-address, too. Note, that OEB (bit 11) is HIGH; therefore, the 
ALU R-source will be the Data-in register and the S-source will be 
the register addressed by A-address. The result (sum), however, 
will be written to the correct register, as IEN (bit 9) is LOW. 


At the same time, the next macroinstruction is fetched in the 
usuall oooverlapping way and the next microinstruction to be 
excuted will be at location FETCH + 1. 


Summary 


In this design shown in Figure 31, we have demonstrated some of 
the addressing schemes mentioned in Chapter 1. We used the 
ADD instruction throughout these examples, but any other arith- 
metic or logic instruction can be executed, in exactly the same 
manner by changing the microcode bits 1-4 to the appropriate 
ALU code. 


The reader is encouraged to write several microcode-lines to 
execute the other addressing modes mentioned in Chapter 1. He 
will discover that when the result of the macroinstruction is to be 
written into main memory, the overlapping instruction-fetch is not 
feasible. In some cases, when the MAR no longer contains the 
Program Counter value, an additional microstep is needed in 
order to restore the Program Counter into the MAR. The reader is 
again encouraged to modify location FETCH in order to save this 
additional microstep. 


Appendix 


Throughout Chapter 3, a number of AC calculations have been 
made to show typical speeds for an Am2901A and Am2903 16-bit 
ALU configuration. This Appendix shows the latest SWITCHING 
CHARACTERISTICS for the Am2901A and Am2903. 


The typical data on the Am2901A shown in this Appendix super- 
sedes that shown on page 2-12 of the Am2900 Family Data Book 
dated 4-78 (AM-PUB003). The only difference between the data 
shown in the typical column of the switching characteristic and 
this Appendix appears in Table 3. The typical carry in set-up time 
should be 40ns. 


The typical switching characteristic data for the Am2903 as 
shown in this Appendix supersedes the data presented in the 
Am2903 Bipolar Microprocessor Slice/Am2910 Microprogram 
Controller Data Booklet dated 3-78. Here, a number changes 


have been made to the table for both the combinatorial propaga-: 


tion delays and the set-up and hold times. 


Should any questions arise concerning the switching characteris- 
tics for either the Am2901A or Am2903, please do not hesitate to 
contact the AMD factory and ask for Bipolar Microprocessor 
Marketing or Bipolar Microprocessor Applications. 
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Am2901A - (MAY 18, 1978) 


ROOM TEMPERATURE TABLE I 
SWITCHING CHARACTERISTICS 


(See next page for AC Characteristics over operating range.) 


Tables |, Il, and Ill below define the timing characteristics of 
the Am2901A at 25°C. The tables are divided into three types 
of parameters; clock characteristics, combinational delays 
from inputs to outputs, and set-up and hold time require- 
ments. The latter table defines the time prior to the end of the 
cycle (i.e., clock LOW-to-HIGH transition) that each input must 
be stable to guarantee that the correct data is written into one 
of the internal registers. 


All values are at 25°C and 5.0V. Measurements are made at 
1.5V with V;, = OV and V\, = 3.0V. For three-state disable 
tests, C, = 5.0pF and measurement is to 0.5V change on 
output voltage level. All outputs fully loaded. 


CYCLE TIME AND CLOCK CHARACTERISTICS 


TIME TYPICAL GUARANTEED 
Read-Modify-Write Cycle 
(time from selection of 55ns 
A, B registers to end of 
cycle) 


Maximum Clock Frequency to 
Shift Q Register (50% duty 40MHz 20MHz 
cycle) 


TABLE II 
COMBINATIONAL PROPAGATION DELAYS (all in ns, CL = 50pF (except output disable tests)) 


TYPICAL 25°C, 5.0V | GUARANTEED 25°C, 5.0V 














A bypassing 
ALU (I = 2xx) 


Clock _# (Note 6) 





SET-UP AND HOLD TIMES (all in ns) (Note 1) TABLE Ill 


A,B 2,4 
Source 3,5 


| 24 | 
[Dlanithmetie modell 
DW= x37) (Note 5) 
e acess 
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a ae 
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Set-Up Time Hold Time 


tpwl + 15 


93 
| wt | 
7 RR 








NOT RM 
ato 


oO 


Ww 
© 


towL + 15 


—_— 
on 


fo) 





Notes 1. See next page. 

. If the B address is used as a source operand, allow for the ‘‘A, B source”’ set-up time, if it is used only for the destination address, use the 
““B dest.’’ set-up time 

3 Where two numbers are shown, both must be met. 

4 “towl" ts the clock LOW time 

5 

6 


N 


DV 0 ts the fastest way to load the RAM from the D inputs. This function is obtained with | = 337 
Using Q register as source operand tn arithmetic mode Clock is not normally in critical speed path when Q is not a source. 


A. Am2903 SWITCHING CHARACTERISTICS (TYPICAL ROOM TEMPERATURE PERFORMANCE) — (MAY 18, 1978) 


Tables IA, IIA, and IIIA define the nominal timing characteris- 


tics of the Am2903 at 25°C and 5.0V. The Tables divide the TABLE IA — Write Pulse and Clock Characteristics 
parameters into three types: pulse characteristics for the : 
clock and write enable, combinational delays from input to Time 


output, and set-up and hold times relative to the clock and 
write pulse. 


Measurements are made at 1.5V with V,;, = OV and Vj, = 
3.0V. For three-state disable tests, C, = 5.0pF and mea- 
surement is to 0.5V change on output voltage level. 





TABLE llA — Combinational Propagation Delays (All in ns) 
Outputs Fully Loaded. CL = 50pF (except output disable tests) 


SIOp 
From ipa | ¥ | Cua | BF [92 N | vA | vB | WATE | al0y aI0, | sid, | Si0s |Party 
tm fe} | © |-ia} mie | - | fe) ol] 
teem 18] > [| *|-is)- |e] - | - [e]e lo 
Coa OC ees ne 
eis he ee ee 
Gn ee OE) ee a ee ee a ee 
es Cee 
pr | TT | ee | - Dm | ot 
en 
EN a ee he are ee 
OEY Enable/Disable 
SI0,, SIO; 
Clock 
Y 
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Notes 1. Applies only when leaving special functions. 
2. Enable/Disable. Enable is defined as output active and correct. Disable is a three-state output turning off. 
3. For delay from any input to Z, use input to Y plus Y to Z. 


TABLE I11A — Set-Up and Hold Times (All in ns) 
CAUTION: READ NOTES TO TABLE Ill. NA = Not Applicable; no timing constraint. 


HIGH-to-LOW LOW-to-HIGH 
With Respect to 
nput to this Signal 
WE HIGH To Prevent Writing 


Y 
NA 


3 
Pa aad 





—s 
[oe ) 
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Chapter IV 
The Data Path — Part Il 








CHAPTER IV 
THE DATA PATH 


The previous CPU example (See Chapter Ill) utilized SSI and MS! 
components to accomplish the shift-linkage, carry control, and 
status register functions associated with the ALU. These func- 
tions can ail be implemented with the Am2904 status and shift 
control unit. 


The Am2904 is an LSI device that contains all the logic necessary 
to perform the shift and status control operations associatéd with 
the ALU portion of a microcomputer. These operations include 
storage for ALU status flags; carry-in generation and selection; 
data-path, carry bit linkage for shift/rotate instructions; and status 
condition code generation and selection. The ALU status flags: 
Carry, zero, negative, and overflow; may be stored in either of two 
registers, amachine status register or a micro status register. The 
carry-in multiplexer can select the true or complement of the 
microstatus carry flag or machine status carry flag, as well as an 
external carry, a logical one, or a logical zero. The shift linkage 
multiplexers provide paths to rotate/shift single and double length 
words up, down, around the carry flag, and through the carry flag. 
The status condition code multiplexer provides tests on the true or 
complement of any status flag, as well as more complicated 
logical combinations of these flags to facilitate magnitude com- 
parisons on unsigned and two’s complement numbers, and nor- 
malization operations. ; 


STATUS REGISTERS 


The status registers contained in the Am2904 are shown in the 
upper portion of Figure 1. Each register is independently con- 
trolled by a combination of instruction signals and enable signals. 


MICRO STATUS REGISTER (uSR) 


The SRis enabled when the CE signal is low. When CE is low 
the instruction present on Is through Ip will be executed on the 
LOW to HIGH transition of the Clock input. These instructions fall 
into three main categories: Bit Operations, Register Operations 
and Load Operations. 


The bit operations allow individual bits of the ~4SR to be set or 
reset. (See Table 1.1). 


The register operations allow the “SR to be loaded from the 
machine status register, to be set to all one’s, reset to all zero’s, or 
swapped with the machine status register. (See Table 1.2). 


The load operations allow the «SR to be loaded from the | inputs 
directly, from the | inputs with |< complemented, or from the | 
inputs with overflow retained, love + Hover > Hover (See Table 
1.3). The load operation with |; complemented can be used to 
emulate machines which use direct subtraction and thus need to 
complement the carry to obtain a borrow. The load with overflow 
retained allows a series of arithmetic instructions to be executed 
without the need for a check for overfiow after each instruction. If 
an overflow occurred at any time during the series it will be 
“trapped.” Thus a single test for overflow, at the end of the series, 
is all that is required. 


MACHINE STATUS REGISTER (MSR) 


The MSR is enabled when CEy, is low. If CEy is low the in- 
struction present on I, through Ip will be executed on the LOW 
to HIGH transition of the Clock input. Additionally the individual 
bits of the MSR may be selectively enabled through the use of 
the Enable inputs Ez, Ec, Ey and Eoyr, (See Figure 1). This 
allows all possible combinations of the four status flags to be 
selectively operated on for maximum flexibility. Thus the in- 
struction specified by Is-lp9 only effect the enabled status flags. 





12, IN. Ic. lovr\ 


MACHINE 
STATUS STATUS 
REGISTER REGISTER 


Mc Mz Mn Movra 


'" OEN- 
love TO 
IC InMcMyMc 


“INTERNAL 


Figure 1. Am2904 Block Diagram. 


The MSR instructions fall into two main categories: register op- 
erations and load operations (bit operations can be implemented 
through the use of the selective enable control lines). 


The register operations allow the MSR to be loaded from the 
bi-directional Y port, or the wSR. Additionally the MSR may be 
set, reset, or complemented (See Table 2.1). These three in- 
structions, combined with the selective enables, allow any com- 
bination of MSR bits to be set, reset, or complemented. 


The load operations allow the MSR to be loaded directly from the | 
inputs, from the | inputs with |p complemented, or from the | inputs 
for shift through overflow (See Table 2.2). The load with Ic com- 
plemented can be used to produce a borrow. The load for shift 
through overflow loads the zero flag and the negative flag from 
the | inputs while swapping the overflow and carry flags. This 
allows the shift through overflow operation to be easily im- 
plemented. 


SHIFT LINKAGE MULTIPLEXERS 


The shift linkage multiplexers control bi-directional shift lines 
SlOn, SIOg (RAM shifter on the Am2903) and QIOn, QiO9 (Q 
register shifter on the Am2903). To enable the shift linkage mul- 
tiplexers the shift enable line SE must be low. When SE is low the 
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TABLE 1. MICRO STATUS REGISTER 


INSTRUCTION CODES. 
1543210 
Octal 


Table 1-1. Bit Operations. 


Operation 



















0 > py RESET ZERO BIT 

1 > pz SET ZERO BIT 

0 > uc RESET CARRY BIT 

1 > uc SET CARRY BIT 

0 > un RESET SIGN BIT 

1 > uN SET SIGN BIT 

0 > vovrR RESET OVERFLOW BIT 
1 noun SET OVERFLOW BIT 





Table 1-2. Register Operations. 


LOAD MSR TO uSR 


SET uSR 
REGISTER SWAP 
RESET «SR 





Table 1-3. Load Operations. 


1543210 “SR 
Octal Operation 
















06, 07 lz > pz 
lc > TT) LOAD WITH 
iy, Se aie OVERFLOW RETAIN 


lovr + Hove > Hover 
lz > wz 
lc > xe 
In > #N 
lovrR > HovrR 
lz > pz 
lc > uC 
In > BN 
love > HovR 














LOAD WITH 
CARRY INVERT 











LOAD DIRECTLY 
FROM 


Iz, Ic, In, lover 







Note’ The above tables assume CE is LOW. 


shift linkage data path will be set-up depending on the state of 
instruction lines Iq through Ig (See Table 3). These instructions 
allow single length or double length shifts/rotates either up, or 
down. Additionally shifts/rotates may be done through or around 
the MSR carry and negative flag. Special operations exist to 
provide support for add and shift (multiply) instructions. These 
instructions select the present carry Ic (for unsigned multiply), 
or the Exclusive-OR of the sign flag |,, with the overflow flag 
love (for two’s complement multiplication). 


CONDITION CODE MULTIPLEXER 


The condition code multiplier selects one of sixteen possible 
logical combinations of the «SR, MSR or | inputs, depending on 
the state of the I5-I9 input lines. These combinations include the 
true or complement form of any individual bitin the ~4SR, MSR or I 
inputs. Additionally several more complicated logical operations 
may be performed to provide magnitude tests on both two's 


complement numbers and unsigned numbers. Table 5 lists the 
conditional test outputs (CT) corresponding to the state of the I,-Ip 
instruction lines. Table 6 lists the possible relations between two 
unsigned or two’s complement numbers and the corresponding 
status and instruction codes. The three-state conditional test 
output CT is active only if OEcr is low. 


CARRY IN MULTIPLEXER 


The Carry output can be selected from one of seven different 
sources depending on the state of instruction input lines. The 
seven possible sources are: logical zero, logical one, the «SR 
carry flag, the complement of the wSR carry flag, the MSR carry 
flag, the complement of the MSR carry flag, or the external carry 
input Cy, (See Table 4). 


TABLE 2. MACHINE STATUS REGISTER 
INSTRUCTION CODES. 


Table 2-1. Register Operations. 


1543210 
Octal eas 


LOAD Yz, Yc, Yn; Yovr 
TO MSR 

SET MSR 

REGISTER SWAP 
RESET MSR 


INVERT MSR 














Table 2-2. Load Operations. 


1543210 
Octal sce 


LOAD FOR SHIFT 
THROUGH OVERFLOW 
OPERATION 







Iz > Mz 












lc > Mc LOAD WITH 
In 2 Mn CARRY INVERT 
lovr — Mover 








Iz —_> Mz 
Ic —_-> Mc 
In > Mn 
lovr — Movr 







LOAD DIRECTLY 
FROM lz, Ic 
In» love 






Note: 1. The above tables assume CEy, Ez, Ec, Ey, Eovr are LOW. 


Y INPUT/OUTPUT LINES 


The bi-directional Y data lines may be used for extra data input 
lines when the Y output buffer is disabled (OEy high). 
Additionally, when I5-lp are low, the Y buffer is disabled, irre- 
spective of the OEy signal. When the Y buffer is enabled (OEy 
is low) the Y data lines are selected from the MSR, wSR, or | 
input lines depending on the state of instruction lines I, and I, 
(See Table 7). 
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TABLE 3. SHIFT LINKAGE MULTIPLEXER INSTRUCTION CODES. 


Loaded 
into Mc 





SiO, | siO, | alo, | alo, 


| 


In ® love 


| 
| 


N N N N N N N N N N N N N N N N 


N N N N N N N N N N N N N N NWN 


MSB LSB MSB LSB 


ae ae a ae 
Os ees pan i at 


O Se Se 
O -~S: 4: 


L] 


M0 Hf 


{ 


L] 


ay 
Hy ye 


Liked 


gag 


° 


Ee. 2 
Ny 
! 


mins 
‘|| (i 
‘|| 


Notes 1. Z = High impedance (outputs off) state. _ 3. Loading of Mc from I19.¢ overrides control from Is.o, CEm. Ec. 
2. Outputs enabled and Mc loaded only if SE is LOW. 


134 


TABLE 4. CARRY-IN CONTROL MULTIPLEXER INSTRUCTION CODES. 


(Hn © Hover) + Hz (un © Hover) + Hz (Mn @® Mover) + Mz (In@® love) + Iz 
(HnQHoveA) * 2z (u4NOQHovR) * Zz (My® Moyr) * Mz (IN© lor) *!z 
Ln ® Hove Hn ® Hove Mn ® Mover In® lovr 
LNOHKOVR LNOHOVR INO lovr\ 
Kz Mz 
Ez Bz 

KOVR love 

Zovr Tovr\ 

Uc + Kz Te + Iz 

Bc * Bz Mc°M: Ioelz 

Kc 

Kc 

Kc + Mz 

kc * Hz 

LN 

En 
Notes 1 @ Represents EXCLUSIVE-OR ©) Represents EXCLUSIVE-NOR or coincidence. 


mMmUODWPFHOAN ODOMNA WH =] O 
=a--64 4 sa=a-Aauni OOoododoaoaoa§oo 


—~ = «=—=& =—=& ©O©F Ood9;dod =F = = =& Oo d Oo 
—- =- OO fa 27t 00+ 2002+ + 0 8 
“~~ O--"- OoO- 0-7 OO =| CO $$ OO =| Os CO 
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TABLE 6. CRITERIA FOR COMPARING TWO NUMBERS FOLLOWING “A MINUS B” OPERATIONS. 


N@OVR = 1 

N @ OVR=1 
(N@OVR)*Z = 1 
(N @ OVR) +Z=1 








@ = Exclusive OR H = HIGH Note For Am2910, the CC input is active LOW, so use I3_,) code to produce 
@) = Exclusive NOR L = LOW CT = L for the desired test. 





TABLE 7. Y OUTPUT INSTRUCTION CODES. 


Y Output 
Output Off 


v 


Z 
a 
pee a Ss ae 
Notes’ 1. For the conditions. 
Is, 14, 13, Io, 14, Ip are LOW, Y Is an input. 


OEy ts “Don't Care” for this condition. 
2. X is “Don’t Care” condition. 










TIMING ANALYSIS 


In the previous chapter a timing analysis was presented with the 
shift-linkage, carry-control, and status registers implemented in 
SSI and MSI. This timing analysis will be repeated with the SSI 
and MSI logic replaced with the Am2904. Tables 8.1, 8.2, 8.4 and 
8.5 list the typical AC characteristics of the registers, Am2902A, 
Am2901A, Am2903, and Am2904 used in these calculations. 
Table 8.3 lists the assumed AC characteristics for the set-up time 
of the Am2904. 


Figure 2 illustrates the timing analysis for an Am2901A based 
design. The analysis begins with the LOW to HIGH transition of 
the system clock. All signals must be valid for the next LOW to 
HIGH transition of the system clock, i.e. one-microcycle later. 


Figure 3 illustrates a similar timing analysis for the Am2903. The 
results of both analysis are listed in Table 9. 


USING THE Am2904 IN A 16-BIT DESIGN 


Perhaps the best technique for understanding the Am2904 is to 
simply compare 16-bit ALU designs with and without the 
Am2904. The first design, Figure 4a, is an example of a 16-bit 
CPU design using SSI/MSI parts instead of the Am2904. In 
Figure 4b, the second 16-bit CPU design, the Am2904 is shown 
replacing the SSI/MSI. The Am2904 substitutes for the appro- 
priate shift matrix control and status registers. A more detailed 
comparison may be obtained by referring to the 16-bit ALU de- 
signs in Chapter III and the one in Appendix C of this chapter. To 
understand the Am2904 further, the usage of the Am2904 is 
described through the microprogram bits in the microprogram 
structure and shown later in the actual microprograms. 


TABLE 8-1. STANDARD DEVICE SCHOTTKY SPEEDS. 


[Beviee and Pain [win [Typ | Max 


S-REGISTER 
Clock to Output 
OE to Output 
Set-up 


Am2902A 
Cn to Cn+x, Y, Z 
G, P to G, P 

G, P to Cn+x, Y, Z 















TABLE 8-2. 
PRELIMINARY SWITCHING CHARACTERISTICS. 


Combinational Delays (ns) 






From (Input) To (Output) tsa 
Iz Yz 

IN Yn 

love Yovr 





CP 
ae 
co 


Ic) In. lov\ SlOp 


a 
2005 | ao, 


TABLE 8-3. ASSUMED SET-UP TIME.* 
TS 


SIO, QIO, 


S10p, QO, 






30 
ee 
so, | 

16 


SIOp, SIO, 
QIO,, Alo, 


S105, SIO, 
QIO,, AIO, 


lOVR, IZ, IN, IC 


*The actual set-up times where not available at the time this was written. 
See current data sheets for correct timing on these signals. 
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TABLE 8-4. 


Am2901A — (MAY 18, 1978) 


ROOM TEMPERATURE 

SWITCHING CHARACTERISTICS 

Tables |, Il, and Ill below define the timing characteristics of 
the Am2901A at 25°C. The tables are divided into three types 
of parameters; clock characteristics, combinational delays 
from inputs to outputs, and set-up and hold time require- 
ments. The latter table defines the time prior to the end of the 
cycle (i.e., clock LOW-to-HIGH transition) that each input must 
be stable to guarantee that the correct data is written into one 
of the internal registers. 


All values are at 25°C and 5.0V. Measurements are made at 
1.5V with V;_ = OV and Vi, = 3.0V. For three-state disable 
tests, C, = 5.0pF and measurement is to 0.5V change on 
output voltage level. All outputs fully loaded. 


TABLE | 
CYCLE TIME AND CLOCK CHARACTERISTICS 


TIME TYPICAL GUARANTEED 
Read-Modify-Write Cycle 
(time from selection of 
A, B registers to end of 93ns 
cycle) 
Shift OQ Register (50% duty 40MHz 20MHz 
cycle) 















Maximum Clock Frequency to 






TABLE II 
COMBINATIONAL PROPAGATION DELAYS (all in ns, CL = 50pF (except output disable tests)) 


TYPICAL 25°C, 5.0V | GUARANTEED 25°C, 5.0V 


Shift 
Y 


D (arithmetic mode) 
D (1 = X37) (Note 5) 






~s 

falas 
~ 
Ol 


ie) 
i<@) 


a) 
ol} w |Olnm 61 | ho] 
A) 
on No} G | oo 
N Si BRIN 





QW} OO [Ro] W | 
aja! o;yOo);o 








A bypassing 
ALU (I = 2xx) 


[clock # (Note 6 40 


GQ | wl 

on;o oO;1on 

NO} NM 5 
aS Ww 








TABLE III 
SET-UP AND HOLD TIMES (all in ns) (Note 1) 


GUARANTEED 25°C, 5.0V 


a 










A,B 2,4 
Source 3,5 


towL + 15 
D (arithmetic mode) 


eee 
Eee 
PD W= x37) (Note S)[ P25 
eee 
teeeoecsl 
a 
idea’ 
Rae 





a eee 





RAMo, 3, Qo, 3 


Notes: 1. See next page. Z 
2. If the B address is used as a source operand, allow for the ‘‘A, B source’ set-up time, if it is used only for the destination address, use the 
““B dest.’’ set-up time. 
3. Where two numbers are shown, both must be met. 
4. ‘“‘towl” is the clock LOW time 
5. DVO is the fastest way to load the RAM from the D inputs. This function 1s obtained with | = 337 
6. Using O register as source operand in arithmetic mode. Clock is not normally in critical speed path when Q its not a source. 


TABLE 8-5. 


A. Am2903 SWITCHING CHARACTERISTICS (TYPICAL ROOM TEMPERATURE PERFORMANCE) — (MAY 18, 1978) 


Tables IA, HA, and IIIA define the nominal timing characteris- 7 
tics of the Am2903 at 25°C and 5.0V. The Tables divide the TABLE IA — Write Pulse and Clock Characteristics 


parameters into three types: pulse characteristics for the 
ime 


clock and write enable, combinational delays from input to ee eee 
Minimum Time CP and WE both LOW 
to write 


output, and set-up and hold times relative to the clock and 
write pulse 
Minimum Clock LOW Time 
Minimum Clock HIGH Time 


TABLE IIA — Combinational Propagation Delays (All in ns) 
Outputs Fully Loaded. CL = 50pF (except output disable tests) 


S105 
Ca+4 WRITE | Qi0,, alo, (Parity) 
om nt B Addresses 
(Arith. Mode) 
A, B Addresses 
(Logic Mode) 





Measurements are made at 1.5V with V,, = OV and Vi, = 
3.0V For three-state disable tests, C, = 5.0pF and mea- 
surement is to 0 5V change on output voltage level. 



















Cee 
Ces ee eee oo a 
ae 
[Sean Fe a I eA 
OEB Enable/Disable ne ee sen eee eMC 
OEY Enable/Disable [ra] - | - | - | - | - | - | - | - [- | - | - | 


fee ee Se ee ee ee 
Clock 




































Notes 1 Applies only when leaving special functions 
2 Enable/Disable Enable is defined as output active and correct. Disable is a three-state output turning off 
3 For delay from any input to Z, use input to Y plus Y to Z. 


TABLE I1IA — Set-Up and Hold Times (All in ns) 
CAUTION: READ NOTES TO TABLE lil. NA = Note Applicable; no timing constraint. 


LOW-to-HIGH 
Hold 


Y To store Y in RAM or Q 
‘WE HIGH Clock 


To Prevent Writing 


ae To Write into RAM 
Clock and WE both LOW re 


To Write Data only into 
the Correct B Address 


To Shift Q 


IEN HIGH as To Prevent Writing into Q 
ENLOW [<5 [Towitemoa | 






' HIGH-to-LOW 






With Respect to 
to this Signal 


























B as a Destination 
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Figure 2-1. 
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Figure 2-2. 
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Figure 2-3. 
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Figure 2-4. 
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Figure 2-5. 
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Figure 3-1. 
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Figure 3-2. 
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Figure 3-3. 
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Figure 3-4. 


DATA 
REGISTER 


DA,DB A, ByI 
CLOCK CLOCK = 


INSTRUCTIONS 


MAGNITUDE ONLY ARITHMETIC OPERATION 
WITH SHIFT DOWN SPEED COMPUTATIONS 


| DEVICE NO. | DEVICE PATH PATH1 | PATH2 
9 9 


CP to Q DATA OUT 4 oF ‘ seus ; 


A,B to G,P 
GP to Cn+xyz REGISTER Am2902A 


Cn to Cn+4 


IC to SIO, 
SIO to Y 


YtoZ 
SET-UP | 


SET-UP Y PATH 1) (Sieeete 


rors PATH 20 wns ocnnmne wnnene mes 








Figure 3-5. 


DATA BUS 





INSTRUCTION 
REGISTER 





OP CODE 
REGISTE 
—_ 3 5 












| 


DATA OUT 
REGISTER 


DATA IN 


SOURCE/DESTINATION 
REGISTER 
aememeeenencee _| REGISTER 


Pes 









DECODER 












CONDITION 
CODE 
MUX 











CONSTANT 
PROM 


M 
aE MICROPROGRAM 
CONTROLLER 


= 1 
PL Am2910 








FUTURE 
EXPANSION 






MICROPROGRAM MEMORY 















29775 
PROMS 


27813 
PROM’S 


PIPELINE 
REGISTER 


REGISTERS 


CONTROL 
BITS 


ADDRESS 
REGISTER 










SUPER SLICES 


SHIFT 
CONTROL 








CONTROL 
BITS 






PIPELINE 
REGISTER 


Figure 4a. 





ADDRESS BUS 


evi 


144 


DATA BUS 


DATA OUT 
REGISTER 


COMPUTER CONTROL 
UNIT (CCU) 


ADDRESS 
REGISTER 
ADDRESS BUS 


MICROPROGRAM 
MEMORY Am2904 
STATUS AND SHIFT 
CONTROL UNIT 
Y 
SUPER SLICES 
REGISTERED 
PROM 


S$103/SI03 


REGISTER 
lo-ly9/ CONTROL 


VO CONTROL 





Figure 4b. 


TABLE 9. TIMING ANALYSIS SUMMARY (ns). Bits PL14 The CE and SE control inputs of the Am2904, 

and PL15 respectively. CEu enables the Micro Status 

Am2901A Am2903 Register. SE enables the Am2904 shift opera- 
Bits PL16 CCU Next Address. 


through PL19 


Bits PL20 CCU Multiplex test select. 
Two’s Complement through PL23 
Arithmetic with 113 ; ; ; 
Shift Down Bit PL24 This bit determines the polarity of the incoming. 
test signal to the CCU. 
109 Bit PL25 Active LOW Instruction Register enabie. 


Bits PL26 CCU multi-way branching select. 


Magnitude only 
Arithmetic with 142 
Shift Down 
through PL29 


THE MICROPROGRAM STRUCTURE Bits PL30 Selects the ALU operand sources. 
through PL32 





The functions of the pipelined (PL) microprogram bits are il- 
lustrated in Figure 5 and as follows: 


PL32 | ALU Operand R | ALU Operand S 


Bits. PLO - This is a shared control field. The field is used 





















through PL11 for branching to a microprogram address or to L L L RAM Output A RAM Output B 
load the CCU counter or control bits for I/O. Pee H RAM Output A DBo.3 

Bit PkL12 The shared control field is determined by L H x RAM Output A Q Register 
PL12, LOW for branching and counting or H L L DAo-3 RAM Output B 
HIGH for I/O control. H IL H DAo-3 DBo.3 

Bit PL13 When LOW, enables the WRITE output and H | H X DAg.3 Q Register 





allows the Q Register and Sign Compare flip- 
flop to be written into. L = LOW H = HIGH X = Don't Care 





Bits PL33 
through PL36 
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Selects the ALU functions. 
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F; = Rj NAND §; 

F; = Rj ORS; 
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Bits PL37 Selects the ALU destination controls. 
through 40 
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This 4-bit wide field is used for the A-address 
source. 


f 


This 4-bit wide field is used for the B-address 
source. 


This 4-bit wide field is the B destination ad- 
dress into which new data is written. 


Am2903 control input OEy. When LOW enables 
the ALU shifter output data onto the Y bus. 


Am2904 instruction code field. 


Am2904 shift linkage multiplexer instruction 
code field. 


Am2904 “carry-in” control multiplexer field. 


The CEy, OEcr, OEy control inputs of the 
Am2904, respectively. 


This bit when LOW, enables bits PL74 through 
PL89 onto the Am2903 DA Bus. 


When LOW, zeros the carry in’s to the Am2903 
slices. 


When HIGH, enables a status register used in 
BCD calculations. 


When LOW, clears the status register. 


When LOW, enables Am2909/11 registers. 


This field contains a 16-bit constant from mi- 
crocode that is passed to the Am2903’s via 
the DA bus. Constant is enabled by PL69. 
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The Am2903 special functions can be selected by the following conditions: lo = |y = lo = Ig = Ig = LOW, TEN = LOW 
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Figure 5. 


SOME SAMPLE MICROROUTINES 


The following algorithms are implemented using the Am2903 
Superslices™ and Am2904 status and shift control unit. The 
algorithms were developed with the aid of AMDASM on System 
29. All algorithms assume values and constants to be initialized 
prior to the entrance of the algorithms. Appendix A relates the 
actual microcode to the microword fields. Appendix B is the 
AMDASM Phase 1 and Phase 2 listings of the microprograms 
and the definitions of mnemonics. Figure 4b is a block diagram 
of the CPU hardware including the Am2904 Status and Shift 
Control Unit from which the microroutines were developed. A 
detailed diagram of the CPU hardware is in Appendix C. 


Normalization, Singie- and Double-Length 


Normalization is used as a means of referencing a number to a 
fixed radix point. Normalization strips out all leading sign bits such 
that the two bits immediately adjacent to the radix point are of 
opposite polarity. 


Normalization is commonly used in such operations as fixed-to- 
floating point conversion and division. The Am2903 provides for 
normalization by using the Single-Length and Double-Length 
Normalize commands. Figure 6a represents the Q Register of a 
16-bit processor which contains a positive number. When the 


~ Single-Length Normalize command is applied, each positive 


edge of the clock will cause the bits to shift toward the most 
significant bit (bit 15) of the Q Register. Zeros are shifted in via the 
QIOO port. When the bits on either side of the radix point (bits 14 
and 15) are of opposite value, the number is considered to be 
normalized as shown in Figure 6b. The event of normalization is 
externally indicated by a HIGH level on the Cn+4 pin of the most 
significant slice (Cn+4 MSS = Q3 MSS ¥ Q2 MSS). | 


There are also provisions made for a normalization indication via 
the OVR pin one microcycle before the same indication is avail- 
able on the Cn+4 pin (OVR = Q2 MSS*¥ Q1 MSS). This is for use 
in applications that require a stage of register buffering of the 
normalization indication. 

Since a number comprised of all zeros is not considered for 
normalization, the Am2903 indicates when wuch a condition 
arises. If the Q Register is zero and the Single-Length Normaliza- 
tion command is given, a HIGH level will be present on the Z line. 


15914 13 12]11 10 9 8 
Q REGISTER 


LSS 
DEVICE 4 DEVICE 3 DEVICE 2 DEVICE 1 


a) Unnormalized Positive Number. 


15914 13 12/11 10 9 8 1 #0 


MSS LSS 


b) Normalized Positive Number. 


MPR-040 





Figure 6. 


The sign output, N, indicates the sign of the number stored in the 
Q register, Q3 MSS. An unnormalized negative number (Figure 
7a) is normalized in the same manner as a positive number. The 
results of single-length normalization are shown in Figure 7b. The 
device interconnection for single-length normalization is outlined 
in Figure 8. During single-length normalization, the number of 
shifts performed to achieve normalization can be counted and 
stored in one of the working registers. This can be achieved by 
forcing a HIGH at the Cn input of the least significant slice, since 
during this special function the ALU performs the function [B] + 
Cn and the result is stored in B. Figure 9 illustrates the single- 
length normalize. However, the microcode is shown in Figure 10. 
Microcode for both single and double normalization can be re- 
duced by one step by testing for zero during passing of number 
into Q. 


Normalizing a double-length word can be done with the Double- 
Length Normalize command which assumes that a user-selected 


Q REGISTER 


DEVICE 4 DEVICE 3 DEVICE 2 DEVICE 1 


a) Unnormalized Negative Single Length Number. 
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RAM Register contains the most significant portion of the word to 
be normalized while the Q Register holds the least significant half 
(Figure 11.) The device interconnection for double-length nor- - 
malization is shown in Figure 12. The Cn+4, OVR, N, and Z 
outputs of the most significant slice perform the same functions in 
double-length normalization as they did in single-length normaili- 
zation except that Cn+4, OVR, and N are derived from the output 
of the ALU of the most significant slice in the case of double- 
length normalization, instead of the Q Register of the most sig- 
nificant slice as in single-length normalization. A high-level Z line 
in double-length normalization reveals that the outputs of the ALU 
and Q Register are both zero, hence indicating that the doubie- 
length word is zero. 


When double-length normalization is being performed, shift 
counting is done either with an extra microcycle or with an exter- 
nal counter. Figure 13 illustrates the double-length normalize 
flowchart and Figure 14 shows the microcode. 


Q REGISTER 


b) Normalized Negative Single Length Number. 
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Figure 7. 
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Figure 8. Single Length Normalize. 


Unsigned Multiply 


This Special Function allows for easy implementation of unsigned 
multiplication. Figure 15 is the unsigned multiply flow chart. The 
algorithm requires that initially the RAM word addressed by Ad- 
dress port B be zero, that the multiplier be in the Q Register, and 
that the multiplicand be in the register addressed by Address port 
A. The initial conditions for the execution of the algorithm are 
that: 1) register R; be reset to zero; 2) the multiplicand be in Ro 
and 3) the multiplier be in Ry5. The first operation transfers the 


multiplier, R15, to the Q Register. The Unsigned Multiply instruc- 
tion is then executed 16 times. During the Unsigned Multiply 
instruction, R1 is addressed by RAM address port B and the 
multiplicand is addressed by RAM address port A. 


When the unsigned Multiply command is given, the Z pin of 
device 1 becomes an output while the Z pins of the remaining 
devices are specified as inputs as shown in Figure 18. The Z 
output of device 1 is the same state as the least significant bit of 
the multiplier in the Q Register. The Z output of device 1 informs 
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Figure 9. Single Length Normalize. 
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Figure 10. Figure 11. Double Length Word. 
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Figure 12. Double Length Normalize. 
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Figure 13. Double Length Normalize. 


DLN R15,R15,OFF & CONT & SHOLD 

MAZ & T & CUP & GOTO ABORT 

LOW R2 & MAC & T & CUP & GOTO END2 

DLN R15,R15 & SDUL & MAO & T & CUP & GOTO JUMP1 


DLN R15,R15 & SDUL & MIO & T & CUP & GOTO JUMP1 
PAR R2,R2 & JP ONE & GOTO LOOP4 

PAR R2,R2 & CONT ONE 

SDRQ R15, R15 & SDMS & END 





Figure 14. 
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Figure 15. Unsigned 16 X 16 Multiply. 


the ALUs of ail the slices, via their Z pins, to add the partial product 
(referenced by the B address port) to the multiplicand (referenced 
by the A address port) if Z = 1. If Z = 0, the output of the ALU is 
simply the partial product (referenced by the B address port). 
Since Cn is held LOW, it is not a factor in the computation. Each 
positive-going edge of the clock will internally shift the ALU out- 
puts toward the least significant bit and simultaneously store the 
shifted results in the register selected by the B address port, thus 
becoming the new partial sum. During the down shifting process, 
the Cn+4 generated in device 4 is internally shifted into the Y3 
position of device 4. At this time, one bit of the multiplier will 
down shift out of the QIO, ports of each device into the QIO3 
port of the next less significant slice. The partial product ts 
shifted down between chips tn a like manner, between the SIOo 
and SIO3 ports, with SlIOp of device 1 being connected to QIO3 
of device 4 for purposes of constructing a 32-bit long register to 
hold the 32-bit product. Shifting of the partial product between 
the B address and Q registers are accomplished via the 
Am2904. At the finish of the 16 x 16 multiply, the most signifi- 
cant 16 bits of the product will be found in the register refer- 
enced by the B address lines while the least significant 16 bits 
are stored in the Q Register. Using a typical Computer Control 
Unit (CCU), as shown in Appendix C, the unsigned multiply 
operation requires only two lines of microcode, as shown in 
Figure 16, and ts executed in 17 microcycles. 


LOPT R15 & F & GRD & PUSH & COUNT O0E 


UMUL R1,R1,R0 & F & CNT & SDDL & RFCT 





Figure 16. 
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Two’s Complement Multiplication 
START 


The algorithm for two’s complement multiplication is illustrated by MULTIPLICAND IN RO 
Figure 17. The initial conditions for two’s complement multiplica- MULTIPLIER IN. R15 

tion are the same as for the unsigned multiply operation. The 
Two’s Complement Multiply Command is applied for 15 clock 
cycles in the case of a 16 x 16 multiply. During the down shifting ieogtorn 
process the term N  OVR generated in device 4 is internally Ris“ Q 
shifted into the Y3 position of device 4. The data flow shown in 
Figure 18ais still valid. After 15 cycles, the sign bit of the multiplier 


is present at the Z output of device 1. At this time, the user must ae oumcue 


place the Two’s Complement Multiply Last cycle command on the MULTIPLY 
DECREMENT CTR 


instruction lines. The interconnection for this instruction is shown 
in Figure 18b. On the next positive edge of the clock, the Am2903 
will adjust the partial product, if the sign of the multiplier is nega- 
tive, by subtracting out the two’s complement representation of 
the multiplicand. If the sign bit is positive, the partial product is not 
adjusted. At this point, two’s complement multiplication is com- 
pleted. Using a typical CCU, as shown in Appendix C, the two’s 
complement multiply operation requires only three lines of micro- 
code, as shown in Figure 19, and is executed in 17 microcycles. 


TWO’S COMPLEMENT DIVISION 
END 


The division process is accomplished using a four quadrant non- PRODUCT (MS) IN R1 
restoring algorithm which yields an algebraically correct answer eye 

such that the divisor times the quotient plus the remainder equals 
the dividend. The algorithm works for both single precision and Figure 17. 2’s Complement 16 X 16 Multiply. 
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Figure 18. 





LQPT R15 & F & GRD & PUSH & COUNT 00D 


TCM R1,R1,RO & F & CNT & SDDL & RFCT 
TCMC R1,R1,RO & SDDL & CONT CZ 





Figure 19. 


multi-precision divide operations. The only condition that needs 
to be met is that the absolute magnitude of the divisor be greater 
than the absolute magnitude of the dividend. For multi-precision 
divide operations the least significant bit of the dividend is trun- 
cated. This is necessary if the answer is to be algebraically 
correct. Bias correction is automatically provided by forcing the 
least significant bit of the quotient to a one, yet an algebraically 
correct answer is still maintained. Once the algorithm is com- 
pleted, the answer may be modified to meet the user’s format 
requirements, such as rounding off or converting the remainder 


START 
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SIGN/MAGNITUDE 
TWO’S COMPLEMENT 
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SIGN/MAGNITUDE 
TWO’S COMPLEMENT 
R3 


2R3—*—R3 


SCALE 
DIVIDEND 


so that its sign is the same as the dividend. These format modifi- 
cations are accomplished using the standard Am2903 instruc- 
tions. 


The true value of the remainder is equal to the value stored in the 
working register times 2"—' when n is the number of quotient 
digits. 


The following paragraphs describe a double precision divide 
operator. 


Referring to the flow chart outlined in Figure 20, we begin the 
algorithm with the assumption that the divisor is contained in 
Ro, while the most significant and least significant halves of 
the dividend reside in R; and Ry, respectively. The first step is 
to duplicate the divisor by copying the contents of Ro into R3. 
Next the most significant half of the dividend is copied by 
transferring the contents of R; into Ro while simultaneously 
checking to ascertain if the divisor (Ro) is zero. If the divisor is 
zero then division is aborted. If the divisor is not zero, the 
copy of the most significant half of the dividend in Rog is con- 
verted from its two’s complement to its sign magnitude rep- 
resentation. The divisor in R3 is converted in like manner in 
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Figure 20. Two’s Complement Division. 
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the next step, while testing to see if the results of the dividend 
conversion yielded an indication on the overflow pin of the 
Am2903. If the output of the overflow pin is ‘one’ then the 
dividend is —2" and hence is the largest possible number, 
meaning that it cannot be less than the divisor. What must be 
done in this case is to scale the dividend by down shifting the 
upper and lower halves stored in R, and R, respectively. After 
scaling, the routine requires that the algorithm be reinitiated at 
the beginning. 


Conversely, if the output of the overflow pin is not a one, the sign 
magnitude representation of the divisor (R3) is shifted up in the 
Am2903, removing the sign while at the same time testing the 
results of two’s complement to sign magnitude conversion of the 
divisor in the Am2910. If the results of the test indicate that the 
divisor is —2" i.e., overflow equals one, then the lower half of the 
dividend is placed in the Q register and division may proceed. 
This is possible because the divisor is now guaranteed to be 
greater than the dividend. If overflow is not a one then we must 
proceed by shifting out the sign of the sign magnitude represen- 
tation of the dividend stored in Ro. At this point we are able to 
check if the divisor is greater than the dividend by subtracting the 
absolute value of the divisor (R3) from the absolute value of the 
upper half of the dividend (Rg) and storing the results in R3. Next, 
the least significant half of the dividend is transferred from R, to 
the Q register while simultaneously testing the carry from the 
result of the divisor/dividend subtraction. If the carry (Cn+4) is 


F= [B] +Cn, Log 2F-Y,B 
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one, indicating the divisor is not greater than the dividend then a 
scaling operation must occur. This involves either shifting up the 
divisor or shifting down the dividend. If the carry is not one then 
the divisor is greater than the dividend and division may now 
begin. 


The first divide operation is used to ascertain the sign bit of the 
quotient. The two’s complement divide instruction is then exe- 
cuted repetitively, fourteen times in the case of a sixteen bit 
divisor and a thirty-two bit dividend. The final step is the two’s 
complement correction command which adjusts the quotient by 
allowing the least significant bit of the quotient to be set to one. At 
the end of the division algorithm the sixteen bit quotient is found in 
the Q register while the remainder now replaces the most sig- 
nificant half of the dividend in R,. It should be noted that the 
remainder must be shifted down fifteen places to represent its 
true value. The interconnections for these instructions are shown 
in Figures 21, 22, 23. Using a typical CCU as shown in Appendix 
C, the double precision divide operation microcode, is shown in 
Figure 24. 


For those applications that require truncation instead of bias 
correction, the same algorithm as above should be implemented 
except one additional Two’s Complement Divide instruction 
should be used in lieu of the Two’s Complement Divide Correc- 
tion and Remainder instruction. However, this technique results 
in an invalid remainder. 
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Figure 21. Double Length Normalize/First Divide Operation. 
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Figure 22. 2’s Compiement Divide. 
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F=(B) + [A] +CaifZ=0 
F=[B}] — [A] -1+CyifZ=1 


Ql03 QI09 


Ch+4 

OVR Am2903 Am2903 Am2903 Am2903 Cy 

N 

S103 S109 
z z z Z 


SIGN COMPARE FF 





MPR-055 


Figure 23. 2’s Complement Divide Correction. 


LOW R10 & JSR & GOTO INP 
PAR R7,R15 & JSR & GOTO INP 
PAR R1,R15, & JSR & GOTO INP 
PAR R4,R15 & CONT 
PAR R3,R7 & CONT 
PAR R2,R1 & T & MIZ & CJP & GOTO ABORT 
SMTC R2,R2 & CONT Z 
SMTC R3,R3 & T & MIO & CUP CZ & GOTO SCALE1 
ALUOFF & T & MIO & CJP & GOTO SKIP6 
SURL R3,R3 & SUL & CONT 
SURL R2,R2 & SUL & CONT 
ALUOFF & JP & GOTO LOOP2 
SCALE LQPT R4 & JSR & GOTO SDIVD 
ALUOFF & JP LOOP1 
LOOP2 SSR R3,R2,YBUS & CONT ONE 
SKIP6 LQPT R4 & F & MIC & CUP & GOTO SKIP3 
ALUOFF & JSR & GOTO SDIVD 
SURL R2,R2 & SDL & CONT 
ALUOFF & JP & GOTO LOOP2 
ALUOFF & F & GRD & LDCT & COUNT 00C 
DLN R1,R1,R7 & T & GRD & SDUL & PUSH 
TDIV R1,R1,R7 & F & CNT-& SDUL & RFCT CZ 
TDC R1,R1,R7 & SUH & CONT CZ 
QMOV R15 & JSR & GOTO OUTP 
PAR R15,R1 & JSR & GOTO OUTP 
ALUOFF & JP & GOTO DIV 
PAR R1,R1 & CONT 
ALUOFF & T & MIS & CJP & GOTO NEG 
PAR R1,R1,ADRQ & SDDL & CONT 
ALUOFF & JP & GOTO RET 
PAR R1,R1,ADRQ & SDDL & CONT 
QMOV R4 & CONT 
PAR R10,R10 & RTN ONE 


START 
NUMBER TO BE ROOTED IN R11 


RO— R1 — R5 
SHOLD 


R5——R5 “A R4 


YES 
R5—— R5 Vv R2 





Figure 24. 


NON-RESTORING BINARY ROOTS 


The algorithm for Non-Restoring Binary Roots is illustrated in a YES RO—=— RO-R5 
Figure 25. The initial conditions required are: 1) the non-negative mae a 
number to be rooted in the radicand register, Rj; 2) Re has the 

positive append bits 101,; 3) R3 has the negative append bits 

011; 4) Ry is the mask register with BFFF,;; 5) Re is the partial 

register with 4000,,; and 6) the counter register, Re, with the sta 

value 084. 

An example of the Non-Restoring Binary Root algorithm is shown SE oarco MeL a RE 


in Figure 26. Starting at the binary point, the number to be rooted 
is partitioned into pairs. The partial value is subtracted from the 
first pair. An intermediate remainder and sign are then produced. Figure 25. Non-Restoring Binary Root. 





154 


ROOT ————___—__> 1 
874 
13549 


1 
ist | 0 
PARTIAL VALUE 1 
0 
1 
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1. 
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4731 
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L_ 
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REMAINDER 


L 
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Figure 26. Non-Restoring Binary Root Example. 


If the remainder is positive, a 1 is entered in the corresponding 
root bit. Then a 01 is appended to the partial, shifted and sub- 
tracted from the present remainder to produce the next remain- 
der. When the remainder becomes negative, the present remain- 
der is not restored. A 0 is entered in the next corresponding root 
bit. Then an 11 is appended to the partial, shifted and added to the 
present remainder. The entire process is repeated until the 
partial root has developed into 8 bits or the remainder is zero. 


Referring to Figure 26, the same method of finding the root 
applies. A starting partial value, Rs, is subtracted from the 
radicand, R;, which produces the intermediate remainder Ro. 
During this time, the sign of the remainder is stored within the 
Am2904. Then Rs is masked by R, to obtain the next partial value 
and R, is shifted to obtain a new mask for the next cycle. Status is 
obtained from the Am2904 and tested. If the remainder is posi- 
tive, a root bit of 1 is developed and bits 01 appended by Ro. 
When negative, a root bit of 0 is developed and bits 11 appended 
by R3. At this point Rg is decremented and tested for zero. If Rg # 
0, then addition or subtraction is performed on the remainder 
depending on the sign bit stored in the Am2904. A new remainder 
is produced and cycled through the procedure again. Figure 27 
illustrates the microcode. 


BCD HARDWARE ADDITIONS 


In applications where fast BCD operations are needed the de- 
signer has the option of using a slight amount of additional 
hardware to dramatically increase the performance of these op- 
erations. These firmware/hardware trade-offs are very applica- 
tion sensitive. The hardware-firmware examples given below are 
specifically for an intensive BCD system with a large fraction of 
conventional logic-arithmetic operations. The designer is willing 
to reduce cycle time slightly to increase BCD thru-put. Small 
hardware additions are acceptable as long as flexibility is re- 
tained. 


LOW R10 & CONT 

LOW RO & CONT 

PAR R1,R15 & CONT 

PAR R2,R0,,DARB & CONST 0005 & CONT 
PAR R3,R0,,DARB & CONST 0003 & CONT 
PAR R4,R0,,DARB & CONST H#BFFF & CONT 
PAR R4,R0,,DARB & CONST 4000 & CONT 
PAR R6,RO,,DARB & CONST 0008 & CONT 
SRS RO,R1,R5 & CONT & SHOLD 

AND R5,R5,R4 & CONT 

SDRL R4,R4 & MAS & CJP & GOTO END3 
SDRL RO,RO, & T & MAS & CUP & GOTO POS 
OR R5,R3 & JP & GOTO CNT 

OR R5,R2 & CONT 

SRS R6,R6,RIO & CONT 

SDRL R2,R2, & T & MIZ & CUP & GOTO END3 
SDRL R3,R3 & T & MAS & CUP & GOTO SUB 
ADD RO,RO,R5 & JP & GOTO CYCLE & SHOLD 
SRS RO,RO,R5 & JP & GOTO CYCLE & SHOLD 
JP & GOTO SQRT 





Figure 27. 


The hardware additions finally decided on were chosen to in- 
crease the performance of BCD to binary conversion, binary to 
BCD conversion and BCD addition. The performance increases 
were approximately an order of magnitude in the first two cases, 
and a factor of 4 or 5 in the last case. A diagram of the additions 
(3% ICs) is given in Figure 28. 


The 74S08 AND gates normally pass the carry from the 
Am2902A to the Am2903s. When microbit CZER is low the 
Carries-in are forced to zero. This is used to “disconnect” the 
carry so that atest may be done on each slice simultaneously. For 
example if a test for 5 or greater is desired a HEX B is added and 








BCD HARDWARE ADDITIONS 


Am2904 


CIN 


CLOCK O 


TO Am2909 OR INPUTS 





C GP C G,P 


Am2902A 


Figure 28. 


the carry out of each slice will indicate the result of the test. This 
allows simultaneous tests on each individual slice and greatly 
increases thru-put. This addition increases the performance of 
BCD to binary conversion and binary to BCD conversion by at 
least an order of magnitude. The drawback to this addition is that 
the AND-gates introduce an extra gate delay in a critical path. The 
machine cycle time may be increased by about 8ns. The increase 
in BCD performance will more than offset this delay for BCD 
intensive systems. 


Another hardware addition is the Am25LS241 three-state buffer. 
This buffer allows the Am2904 to be used to store the carry-out 
status bits via the bi-directional Y bus. 


The 25LS195A is wired as a 4-bit register with clear and enable. 
This register is used to store the carry-out bits from a test cycle. 
The outputs of the 25LS195A are ORed with the output of the 
Am2904 Y-bus and connected to the Am2909 OR inputs in the 
CCU. This allows a multi-way branch on the OR of two test 
cycles, greatly increasing the performance of BCD addition. 


BCD TO BINARY CONVERSION 


The usual method of BCD to binary conversion is to divide the 
BCD number by 2. The 1-bit remainder will indicate if a 1 
existed in the BCD number. The previous division result is di- 
vided by 2 again and the remainder will indicate if a 2 existed 
in the BCD number. In general the remainder from a division 
by 2" will indicate if a 2"~' existed in the BCD number. 


These remainders can be used to construct the binary rep- 
resentation, b,2" + by_y2 1 + bp_o2™? +... 4+ by2? + 
bp2° The b, bit is thus the remainder from division step n + 
1. The binary representation may thus be created by shifting 
the remainders down until the m-bit BCD number has been 
divided by 2 m times. 


To divide a BCD number by 2 a down shift is executed. The 4, 2 
and 1-bit positions will contain the correct result, but the 8-bit 
position is incorrect. Its value has changed from 10 to 8 instead of 
from 10 to 5. This means the resulting BCD number will have a 
value 3 greater than it should for the division by 2 to be correct. A3 
must be subtracted from any digit in which a 1 entered its 8-bit. 


A sample conversion is given in Table 10. The BCD number is 
gradually shifted down and corrected when necessary. The bi- 
nary number is finally correct after 16 cycles. 


A flow diagram for the algorithm is given in Figure 29. The BCD 
input, A, is shifted down into the binary output B, to start the loop. 
The constant 0888 is added to A with the carries-in forced to zero. 
The resulting carries-out will indicate if A contained a 1 in any of 
the 8-bit positions. These carries are saved in status register 
SR1. A multi-way branch is then executed to enter the adjust 
table. The digits are adjusted depending on the previous test. At 
the same time a shift can be executed to prepare for the next test 
instruction. A test for end of loop is also done in this cycle to 
provide an exit if 16 iterations of the loop are complete. Finally a 
shift up of B is needed to cancel the extra right shift when the loop 
is exited. The microcode for this algorithm is given in Figure 30. 
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TABLE 10. 


BCD —~ Binary 


Result 


1000 


11000 


011000 


1011000 


01011000 


101011000 


SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 


Operation 


DIGIT 1 


DIGITS 2, 0 


DIGIT 1 
DIGIT 1 
DIGIT 1 
DIGIT O 
NONE 
NONE 


DIGIT 0 





1101011000 SHIFT 


ADJUST NONE 


01101011000 SHIFT 


1011010 


ADJUST NONE 
11000 SHIFT 
ADJUST NONE 


0101101011000 SHIFT 


0010110 


ADJUST NONE 
1011000 SHIFT 
ADJUST NONE 


000101101011000 SHIFT 


ADJUST NONE 


0000101101011000 SHIFT 


A: = BCD NUMBER 
B. = BINARY RESULT 
D: = DUMMY REGISTER 


O—A + 0, 8, 8, 8, 
CARRIES-IN ~<<— 0; 
SR1~- CARRIES-OUT; 


ADJUST; 
SHIFT A DOWN INTO 
B; LOOP 15 TIMES; 


Figure 29. BCD to Binary Conversion (16 Bits to 14 Bits). 


ADJUST NONE 





BINARY TO BCD CONVERSION 


A method very similar to the one used for BCD to binary conver- 
sion may be used for binary to BCD conversion. The BCD number 
is created by shifting the binary number up, into a partial BCD 
result. The BCD number is adjusted to provide a multiplication by 
2. The shift adjust process continues until the least significant 
binary bit is shifted into the BCD result. 


The adjustment is needed when a 1 is shifted from the 8-bit 
position to the 1-bit position of the next digit. the value has 
increased from 8 to 10, instead of from 8 to 16. To correct this a 6 
must be added to any digit that has a 1 shifted out of its 8-bit 
position. Alternately a 3 could be added before the shift to any 
digit that has a 1 in its 8-bit position. 


Another correction is needed whenever an invalid BCD digit is 
encountered. If a number greater than 9 is detected in any digita 
10 must be subtracted from that digit and a 1 added to the next 
highest digit. The same correction can be accomplished if a 6 is 
added to the invalid digit after the shift. To correct before the shift 
a 3 is added to any digit which contains a 5, 6 or 7. These 
adjustments are summarized in Table 11. Both adjustments may 
be accomplished by adding a3 to any digit which is greater than 4. 


Table 12 shows an example conversion. The binary number is 
gradually shifted up and the BCD partial result adjusted. After 14 
iterations the conversion is complete. A flow diagram for the 
algorithm is given in Figure 31. 


ENR & COUNT LOOP & CONT 
PAS RO, RO LDRQ & SDDL & LDCT & CONST 15 

ADD R1, RO, RO, DARB & ALUOFF & CONST 0888 & CZERO & ENSUR1 & CLSR2 & RPCT 
ALUOFF & MULTI 8WAY 
ALIGN 8 

ALUOFF & CJRP & CNTR & GOTO EXIT 


SUB RO, RO, RO, LDRQ,DARB & CONST 0003 & CJURP & CNTR & GOTO EXIT 
SUB RO, RO, RO, LDRQ,DARB & CONST 0003 & CURP & CNTR & GOTO EXIT 
SUB RO, RO, RO, LDRQ,DARB & CONST 0003 & CJURP & CNTR & GOTO EXIT 
SUB RO, RO, RO, LDRQ,DARB & CONST 0003 & CURP & CNTR & GOTO EXIT 
SUB RO, RO, RO, LDRQ,DARB & CONST 0003 & CURP & CNTR & GOTO EXIT 
SUB RO, RO, RO, LDRQ,DARB & CONST 0003 & CJURP & CNTR & GOTO EXIT 
SUB RO, RO, RO, LDRQ,DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
PAS RO, RO, RO, LURQ & SDUL & RTN 





Figure 30. 


TABLE 11. 


Illegal BCD 


Shift Thru 
Correction 





Initially the 14-bit binary number is left justified by two shift up 
operations. To start the loop the binary input, B, is shifted up, into 
the partial BCD result, A. The constant BBBB is added to A, with 
the carries-in forced to zero. The resulting carries-out are stored 
In status register SR1. A multi-way branch is used to enter the 
adjust table. The digits are adjusted depending on the result of the 
previous test. In the same instruction a shift is executed to pre- 
pare for the next test cycle. Additionally an end of loop test is used 
to provide an exit if 16 iterations of the loop are complete. Before 
the exit a fix-up cycle is used to cancel the extra shift executed in 
the loop. The microcode for this algorithm is given in Figure 32. 


BCD ADD 


One method of performing a 4-digit BCD add is to do a 16-bit 
binary add, with the carries-in forced to zero, and adjust the 
resulting sum. The adjustments are necessary to change invalid 
BCD digits to valid BCD digits. When an invalid digit is modified 
a carry to the next highest digit is generated. This could cause a 


SHIFT UP B 


‘ 


SHIFT UP B 
SHIFT UP B 
INTO A 


D——A + 11, 11, 11, 11; 
CARRIES-IN—~ 0 
SR1—— CARRIES-OUT 


MULTI-WAY BRANCH 
ON SR1 


16 


ADJUST; 
SHIFT-UP B INTO A; 
LOOP 15 TIMES; 


SHIFT DOWN A 





Figure 31. Binary to BCD Conversion (14 Bits to 16 Bits). 
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Q. =Binary Input 
RO. = BCD Result 


SURL RO, RO & SUL & CONT 

SURL RO, RO, & SUL & ENR & COUNT LOOP & CONT 

PAS RO, RO, ,LURQ & SDUL & LDCT & COUNT 15 

ADD R1,R0, RO, DARB & ALUOFF & CONST BBBB & CZERO & ENSR1 & CLSR2 & RPCT 
ALUOFF & MULTI 16WAY 

ALIGN 16 

ALUOFF & CJRP & GOTO EXIT 

ADD R1, RO, RO, LURQ,DARB & CONST 0003 & CURP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURQ,DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURQ,DARB & CONST 0003 & CJURP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURQ,DARB & CONST 0003 & CJURP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURQ,DARB & CONST 0003 & CURP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURQ,DARB & CONST 0003 & CURP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURQ,DARB & CONST 0003 & CURP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURQ,DARB & CONST 0003 & CJURP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURQ,DARB & CONST 0003 & CJURP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURQ,DARB & CONST 0003 & CJURP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LUROQ,DARB & CONST 0003 & CURP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURQ,DARB & CONST 0003 & CURP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURQ,DARB & CONST 0003 & CJURP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURQ,DARB & CONST 0003 & CURP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURQ,DARB & CONST 0003 & CURP & CNTR & GOTO EXIT 


SDRL RO, RO, & SDL & RTN 





Figure 32. Binary to BCD Conversion Microcode (14 Bits to 16 Bits). 


Result 


Digit 
2 


Digit 
1 


Digit 
0 


TABLE 12. 


Binary —~ BCD 
Conversion 


00101101011000 
0101101011000 


101101011000 
01101011000 
1101011000 
101011000 
01011000 
1011000 
011000 
11000 

1000 

000 

00 


0 


SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 
SHIFT 
ADJUST 


Operation 


NONE 
NONE 
NONE 
NONE 
DIGIT O 
NONE 


NONE 


DIGIT O 


DIGIT 1 


DIGIT 1 


DIGIT 1 


DIGIT 2 


DIGIT 1 


NONE 





previously valid digit to become invalid. The word must be 
checked and modified until all digits are valid (up to four modifi- 
cation cycles could be necessary). 


Initially the two BCD numbers are added with the carries-in to 
each digit forced to zero. The carries out are saved. Next the hex 
number 6666 is added to the sum, with the carries-in forced to 
zero, and the resulting carries out are saved. This tests each 
digit for validity, a carry-out indicating an invalid BCD digit 


A-~-A + B + CIN; 
CARRIES-IN—— 0; 
SR1-*— CARRIES-OUT; 


D——A + 6,6,6,6; 
CARRIES-IN——0; 
SR2—— CARRIES-OUT; 


MULTI-WAY 
BRANCH ON 
SR1 OR SR2 


ADJUST DIGITS; 
SR1i~—— 0; 


ADJUST ADJUST 


Figure 33. BCD Add. 





(greater than 9). If a carry was generated in either cycle a 6 is 
added to the invalid digit, with carries-in forced to zero, to create 
the valid BCD digit. Additionally a 1 must be added to the next 
highest digit to provide the BCD carry-out. Each time a digit is 
adjusted the carry-out may invalidate the next highest digit. Thus 
adjustment cycles must be followed by validity tests until all 
digits are valid. A flow diagram for this algorithm ts given in 
Figure 33. The microcode for this algorithm is given in Figure 34. 


A R1 
B RO 


1 ADD R1,R1,RO & CZERO & ENSR1 & CONT Z 

2 ADD R1,R1,R0,,DARB & ALUOFF & CZERO & ENSR2 & CONST 6666 

3 ALUOFF & MULTI 16WAY & RMAC 

ALIGN 16 

4 ALUOFF & JMP & GOTO EXIT & ENSR1 

5 ADD R1,R1,R0,,DARB & CONST 0016 & GOTO LOOP & CLRSR1 

6 ADD R1,R1,R0,,DARB & CONST 0160 & GOTO LOOP & CLRSR1 

7 ADD R1,R1,R0,,DARB & CONST 0176 & GOTO LOOP & CLRSR1 

8 ADD R1,R1,R0,,DARB & CONST 1600 & GOTO LOOP & CLRSR1 

9 ADD R1,R1,R0,,DARB & CONST 1616 & GOTO LOOP & CLRSR1 
10 ADD R1,R1,R0,,DARB & CONST 1760 & GOTO LOOP & CLRSR1 
11 ADD R1,R1,R0,,DARB & CONST 1776 & GOTO LOOP & CLRSR1 
12 ALUOFF & JMP & GOTO LOOP & SMAC & CLRSR1 
13 ADD R1,R1,R0,,DARB & CONST 0016 & GOTO LOOP & SMAC & CLRSR1 
14 ADD R1,R1,R0,,DARB & CONST 0160 & GOTO LOOP & SMAC & CLRSR1 
15 ADD R1,R1,R0,,DARB & CONST 0176 & GOTO LOOP & SMAC & CLRSR1 
16 ADD R1,R1,R0,,DARB & CONST 1600 & GOTO LOOP & SMAC & CLRSR1 
17 ADD R1,R1,R0,,DARB & CONST 1616 & GOTO LOOP & SMAC & CLRSR1 
18 ADD R1,R1,R0,,DARB & CONST 1760 & GOTO LOOP & SMAC & CLRSR1 
19 ADD R1,R1,R0,,DARB & CONST 1776 & GOTO LOOP & SMAC & CLRSR1 


EXIT 


Figure 34. BCD Add Microcode. 





SUMMARY 


In this chapter, a detailed description of the Am2904 was pre- 
sented, along with an example timing analysis. Several micro- 
code algorithms were:discussed to show how the Am2904 oper- 
ates in a 2903 based CPU. As can be seen, the Am2904 provides 
a powertul, single-chip LSI solution to the shift multiplexer, status 
register, and carry multiplexer design portion of a CPU using 
either the Am2901B or the Am2903. 


The Appendix includes a full microcode listing. The interested 
reader is encouraged to study these listings to gain a better 
understanding of the hardware organization (Appendix C). An 
additional microcode listing (Appendix B) gives the AMDASM™ 
definition file and source file for the microcode. The reader should 
study these listings while referring to the AMDASM Manual. (The 
Am2900 Family Data Book contains an AMDASM Reference 
Manual, document AM-PUBO003, 4-78 FRODO.) 
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APPENDIX A 


COMMENTS 


Am2904 CONTROL FIELD 


=z 
2) 
- 
Oo 
=~ 
io 
= 
2) 
2 





~- 
oO 
w 
oO 


ct af 
” 
od 
oO 
89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 7473 72 71 70/69/68167 166 65 64/63 62 61 60/59 58 57 56 55 54/53 





CONSTANT 


ADDRESS LABEL 


XX X X X X XX XX KX KX KX KX KX XIX KX KX XIXIXIXIX1O OFX KX KX XIX KX KX X XK XKiIO 


UNSIGN 
MULTIPLY 


O|1X X X X X XO 


1 


X XX X X KX X XX KX KX KX KX KK KIX KX KX KIXIXIXIX/O 010 1 


TWO'S 


COMPLEMENT 
MULTIPLY 


XX X X KX X XX KX KX KX KX KX KX XK KIX X KX KIXIXIX|X1O O1X X KX XI[X KX KX X XK KIO 


0 


O|X X X X X XI0 


1 
1 


XX XX X XX X KX KX KX KX KX KX K KIX KX KX KIXIXIXIX10 O10 1 
X XX X X KX KX X KX KX KX KX KX KX K KIX K KX KIXIXIX| xX) 1 


O|X X X X X X{0O 


0 1 


1 


XX XX XX X XX KX KX KX KX KX K KIX KX KX KIXIXIXIX)O O1X X X XIX X X X XK KIO 


COMPLEMENT 
DIVIDE 


TWO'S 


X X X|X)X1X1X}O O1X X X XIX X KX X XK KIO 


X X X X X XX X X KX KX KX K KX KX KIX KX KX KIXIXIXIX1O OX X X XIX KX KX KX XK KIO 
XX XK X KX X KX XX KX KK KKK KIX KX KX KIXIXIXIX}O O1|X X X KIX KX K X XK XIO 
XX X XX XX XX KX KX KX KKK KIX KX KX KIXIXIXIX}O OX X KX XIX KX KX X XK XKiIO 


XX X XX X KX KX KX KX KX K KX XK K XKILK 


0 01|0 


O1X X X X|X X X X X XjO 


O;X X X X10 


XX XK XXX KX KX KX KK KKK KIX KX KX KIXIXIO;X}/O OIX X X KIO 


XXX XXX XXX XXX KX XK XIX KX KX KIXIXIXIX!0 O10 O 


1 


0 


XX XX XX XXX XX XX KK XIX KX X KIXIXIO;X1O O;X xX X X10 1 


XX X XX X KX XX X KX KX KX KX KX KIX KX KX KXIXIXIXIX] 1 
XX X XX X X XX KX KX KX KX KX KX KIX KX K XKXIXIX1OlXs) 1 


0 

0 
O|X X X X X X10 
O|X X X X X X10 


1 
1 


1 
1 


X XX XX KX X KX KX KX KX KX K KK KIX KX KX KIXIXIXIX10 010 O 


XX XX KX X KX X KX KX KX KX KX KX K KIX KX KX KIXIXIXIX1O OX KX X XI|X K KX X XK KIX 
XX XXX XK XX KX XX KX KX KX KIX KX KX XKIXIXIXIX1O OX X KX XIX KX KX XK XK KIO 


SCALE 1 


5 


2 


X X X X|X KX X KX X XIX 
XX X X|X X X X X XIO 


1 
1 


X XX XX KX KX XX KX KX KK KK KIX KX KX KIXIXIXI XO 


XX X XX XX KX KX KX KX KX KX KX XK KIX KX KX KIXIXIX| XO 


LOOP 2 
SKIP 6 


7 


8 


2 
2 


1 


1 


X XX XX KX KX KX KX XK KX K KK KIX KX KX KIXIXLO;XIO O1xX X X XKIO 


1 


XX X XX KX KX XXX KX KX XK KK KIX KX KX KIXEXIXIXIO OFX KX KX XIX KX KX XK XK XIX 
XXX KX XXX XX XK XK KK KK KIX KX K KIXIXIX|XI0 0;0 0 O O]|X K XK X X XIN 
XX X XX XX KX KX XK KX KX KX KX KIX X KX KIXEXIXIXIO O;X xX X XIX X XK X XK XIX 
XX X KX KX KX X KX KX KX KX KX KX KK KIX KX KX KIXIXIXIXI1O O1X X KX XIX KX KX X XK XIX 


XX X KX X KX X KX KX KX KX KX KX KX K KIX KX K KIXIXIX|XIO O 
XX XX X X X X KX KX KX KX KX KX KX KIX K K KIXIXIX|X 


2 C | SKIP 3 


1 


0 


X X X X X XIO 
X X X X X X10 
X X X X X XO 


1 
1 
1 


1 
1 
1 


1 
1 


1 
1 


0 


1 
1 


0;0 O 


XX XX X X KX KX KX KX XK XK KK KIX K K XKXIXIXIX|X 


XX KX XX KX KX KX KX KX KX KX KX KK KIX KX KX KXIXMIXIXIX{O O1X KX X XIX XK X X X XIO 
XX XXX XX XXX XX KX KX XK KIX X KX XIXPXIXIXLO O1X KX X XIX KX XK XK XK XKIO 
X X X X X X XX KX KX KX K KX KX KX KIX KX KX KXIXIXIXIX;O O1X X K XIX KX X XK XK KIX 
XX KX XX XX KX KX KX KX KX K KX K KIX KX K KIXIXIX|X}O O;X KX KX XIX K K X XK XiIO 
XX X XX X KX X KX KX KX KX KX KX KX KIX KX KX KXIXEXIO;XIO OX xX X XIO 


XX XXX XXX XX KX KX KX KX K KIX KX KX XIXIXIXIXIO O10 


1 


1 


O|;X X X X X X10 


1 


1 


XXX XXX XX KX KX KX KX KX KX KX KIX KX X KXIXIXIXIX}O O1X X X XIX X X X X XIX 


XX X XX X KX X KX KX KX KX KX XK KX KIX X KX XIXIXIXIXIO O10 


O|X X X X X XI{0 


1 


1 


XX X KX KX XX KX KX KX KX KX KX KX KX KIX KX KX KXIXIXIXIX1O OX kX KX XIX KX X X X XIN 


XX XXX KX KX KX KX KX KK KK KIX K KX KXIXIX| XL Xo 


X X KX XjX X X X X XIO 


1 


E 
aece 
SH 
AR 
ED 
c 
ee 
FIE 
LD 


D 
EVICE 


cc 
U 
co 
N 
7 

FIE 

LD 


A 

m 
290 
3 ‘ 

co 
pene 
FI 
EL 

D 
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APPENDIX A 


Am2904 CONTROL FIELD 


COMMENTS 


2 
Q 
© 
a. 
- 
ze 
- 
” 
z 





CONSTANT 
89 88 87 86 85 84 83 82 81 BO 79 78 77 76 75 74173 72 71 70169168167166 165 64/63 62 61 60/59 58 57 56 55 54 


ADDRESS | LABEL 


SINGLE 
LENGTH 


XX XX XX XX XX KX KX KX KX KX KIX KX XK KIXIXIX|X{O O;X KX xX KX X KX XK XK XO 


1 


0 


NORMALIZE 


1 0 0;xX 


0 
0 
0 
0 


1 
1 
1 
1 


XX X X X X X X X X XX KX KX KX KIX KX KX KIXIXIO;XjO OFX XK XK XIO 


1 
1 


XX X X X XX XX KX KX XXX KX KIX KX XK XIXIXIO;X!0 O;X X X XIO 


X X XX X X XX XX XX KX XK KX KIX KX KX XIXIXjIO;X)O 1 


X X X XI0 


1 3 F 
1 


0 
0 


1 


X X X X|O 


X X X XX X X X X X KX KX KX KX KX KIX KX KX KIXIXIO;XIO 1 


4 0] AGAIN 


O|X X X X X Xj{0 


1 


XX X X X XX XX XXX KX KX XK KIX KX KX KIXIXIXIXjo0 0;0 O 


XX XX XX XX X X XX KX XK K KIX KX KX KIXIXIXIXIO O;X KX KX KIX K KX XK XK KIO 


DOUBLE 
LENGTH 


XX XX XX XX X XX KX XK XK XK KIX X K KXIXIXIXIXIO O1X X X XX X KX XK XK KIO 


XX KX XX X XX XX XX KX KX KX KIX KX KX KXIXIXIOXIO OX XK X KIO 


4 


1 


0 


NORMALIZE 


010 0;|X 
0 


1 
1 


1 


XX XX X X X XX XK KX KX KX KX KIX KX KX XKIXIXIO;XIO O;X X X XIO 


XX XX XX XXX XX KX KX KX X KIX K KX XIXIXIO XO 010 + 


1 0)0 
1 0/0 


0 


XX XX XX XX XX XX KX KX KX KIX KX KX KXIXIXIO;X}O 0;0 1 
X X X X X X XX KX X KX KX KX KX KIK KX XK KIXIXIX|X1d 1 


LOOP4 


X X X XIX X X X X XO 


X X X XIX X XX KX XK KIO 


X X X XK X X XX KX X KX XX KX KX KIX KX K KIXIXIXIX1d O10 


1 


X X X X X KX X X X KX KX KX KX KX KX KXIK KX K KIXIXIX;XIO 


4 E | JUMP1 


1 


X X X X X X10 


1 


0 


1 


BINARY 


ROOTS 


XX X X X X X KX X X KX KX KX KX KX KLIK KX K KIXIXIXIX{O oO; xX X KX XIX KX KX XK XK KIO 


XX XX X XX KX KX X KX KX KX KX KX KEK K K KIXIXLXIX1O O|X KX X XIX K KX XK XK XO 


XX XX X KX XX X XX KX KX KX X KXIK KK KIXIXIXIX;O O|X KX X XIX KX XK XK XK XO 


X X KX X1OlX{X|X10 O|X K KX XIX X KX XK XK KIO 


0 1 


0000000000000 1 


X X X X1O}XIX}X}O0 O1X X X XIX X KX XK XK KIO 


1 
1 


0000000000000 0 1 


X X X X1O0{X|IX|X10 O1X X X XIX X XK XK X XO 
000000000000 0 0}X XK X X}O;}XIXIX;O0 O1|X X X XIX K X XK XK XIN 


1 


1 


1 


0 


0 0 O;X X X XIOIX{XIX1O0 O|X X X XIX X KX XK XK KIO 


XX X XX X KX XX KX KX KX KX KX KX KIX KX XK XIXIXIXIX}O O1X KX X XIX X KX XK X KIO 


5 Bi CYCLE |X X X X X X X KX X KX KX KX KX KX KIX K WK KIXIXIX|X!|O OX KX X XIX KX KX XK XK KIO 


000000000000 1 


1 


0 


1 
1 


XX X X X X KX KX KX KX KX KX KX KX KX KEK KX KX KIXIX}O;X10 0; X X XK X 


XX X XX X KX X KX KX KX KX K K KX XIX KX KX KIXIXIOIX!O0 QixX xX XK xX 


XX X X X X KX X KX KX KX KX KX KX KX XIX KX KX KIXIXIXIXIO O|X X X XjX X XK X XK XO 


XX XX XX XX XX XXX xX X XIX X X KXLX(XIXIX1O O1X xX xX XIX X X X XK KIO 


5 F| POS 
6 0} CNT 


1 
1 


0 
0 


X X XX KX KX X KX KX KX KX KX KX KK KK KX K KXIKXIXIX;X;O OX KX KX KXiX KX KX KX KX KIO 


XX X KX X X KX KX X KX KX KX KK KX KK KX KX KIXIXLAO;X1O OX X X x10 


010 0/0 
1 


1 
1 


XX X X X X KX X KX KX KX KX KX KX KX XIX KX KX KIXIXIO;XIO O;X X X KIO 


X X X XK KX KX KX K KX KX KXeKX KX KX KX XIX KX WK KIXIXIXIX1O-01X KX K XIX KX KX XK XK KIO 


XX X X X KX KX KX X KX X KX KX KX K KIX KX K KIX XIXIXIO OO; X KX KX XIX KX KX K XK KIO 


SUB 


4 


6 





SHARED CONTROL FIELD 
ENABLE 


DEVICE 


CCU CONTROL FIELD 


Am2903 CONTROL FIELD 
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ao t xoooOox-Ex xoroo0ox°9o xx xx Xx xX «XK «KKOFrOXxKOOr 
335 Ys) xr COOxXKX xr OCOC0OXxXxO xxx *K KKK KK KR OFX Xr rT O 
aS © Xxerrr xx xrrrrr xO xxx «KKK KKK KR RR KK re 
a7O0 |r xO00Oxx x OOCOOxXO x << <x xX XX KX «KK OOOxXKOOD 
= © xrrrer kx Xr er rer er Ke xxx «KK «KKK KK RK Ke 
oa xOo0O000 XK x xoo0con0o00o x oOo x KKK KKK KK KeOOOKK aOFO oO 
° XxCO00x x Xx OO0D00xXO xxx<x xxx «xX xKXOOOxXKOOO 
= xo000x x xooc000xKO x«Kx< «KK «KK KK KOOOXxxKOO0°O 
193738 a71aid| © xrere xXx Keer e re xXxe xxxKxXXKKxKK XE EE KK Eee 
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| NA SNLVLS| + oooo0o0°co ooooo0o°o°o ooooooooo°coeomooo0o°o°c;°o 
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Eig Nn xOCO00Ox x xCCOOxXXxKO xxx x xx KK «KK OOK KK OO x 
- Q xrrr rx &X xrrrrxkxxKO xxx «KK «KKK KK ER KK KR KX 
ALIHW10d| = xr rr OX xr rrrxXxXO xx XK KK KKK KK KKK KX 
[wauilg) ee ee eee ee ee ee ere 
x CO000000 eco000OoCoOO 27OCODODCOCOCCOOCOOOCOCSO 
Fe Pe ocoCCoCOOO oCo00Co000 SCO CDCOOCDOOCCOOOOOO”O 
33 S ecoooc0o0o co0oCCooo COCOCCOCDOOCOOOOOOOOCO0NO 
3 oooo0o°o°o oooo0o 0000 oooo0ococoooooo0oo0o°”coeoo 
8 ox KX OOOO OoxC_KM OOOO Oo xxeMoooo0ooeoccoco0cnceooo0o e000 8} 
oO Ox KM OOO Oo om KMOOCOO0O 0 <xeCeoocoeocoocceceoooo0noo°oeoe 3S 
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. + xx x xX «KK O xx xX KX KOO Xx xXxxKrOOCOOrrOKxKXK KOM Mere 
<a? > xx xxx xO xxx xXx r rx xx rOOCCDOOOOKXxKXxXxr xk xKOO 
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& 2 oxxKxOOxXO -xx< rr KK xx «xx «& «xX «Kr Kr OOrTrOoOOoOOreo°o 
on? © -xxX ee Ke -xx+ xxx xx xx xxx kK OCOOr+r+r oo 
Qo | oxxXxooxo xXx ro xXx xX xx XxX XXX xK Orr OOOrO0000 
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one We -xKxOrr Xr ex ere r er xX -OCOrro0r0d00000rrraono 
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APPENDIX A 


Am2904 CONTROL FIELD 


COMMENTS 


CONSTANT 





r 
Q 
- 
oO 
=> 
eg 
- 
” 
< 


89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74|73 72 71 70/69|68|67 66/65 64|63 62 61 60/59 58 57 56 55 54 


ADDRESS LABEL 


X XIX X¥ X XIX X X XK X X 


1 
1 


0 1;1;0;)1 
0 1 
0 0 0/0 


1 
1 


XX XX XX XXX XXX XX XK XIO 
X XX XX XXX XX KK XK XK XK XII 
1 
X X X KX X KX XX X X KX KK XK XK X 


0; ENTER 


O|X X X KX X X 


0/1 X X/0 1 1 
11/1/;0;0 O;}X X X X}0 000 0 0 
O;1/;1};X X|X X X X{|X KX KX K XK X 


1 


1 
1 


1 0 


1 


00 0 000100 0}1 


1 


0000 


3} LOOP 


CONVERSION 


BCD TO BINARY 
ROUTINE 


Xx X KX X X X 


1 
1 


1; 0; 0 
1 


1 
1 


0 
0 
0 


1 
1 


1 
1 
1 
1 
1 


0000000000000 0 0 0 
0000000000000 0 1 


BRANCH 
TABLE 


X XX KX XK X 


1 


1 


0; 0 


1 


0; 0; 1 
0/0; 1 


100 0 0 

100 1 
0000000 0 
000000 
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APPENDIX B 


AMDOS/29 AMDASM MICRO ASSEMBLER, V1i.1 
CPUII DEFINITIONS 


sADVANCE MICRO DEVICES 
; AMZ2923 AND AM2904 DEFINITION FILE FOR CPUIT 


j 
;REV. OCTOSER 17, 197& 


WORD 92 
;ZQUATES 


oN s EQU RAF 
Se: HOU E#O 
OFF: FQU B#1 


32903 DESTINATION MODIFIERS 


ADE S$ FOU H#@ 
LDR: ZOU H#1l 
ADRQ: FQU E#2 
LDRQ: EQU H#3 
RPT: BQU E#4 
LDQP: KOU F#S5 
QPT: EQU A#E 
RQPT: SQU H#? 
AUR: EQU E#8 
TUR: LOU E#9 
AURQ: EQU H#A 
LURQ: EQU FHB 
YBUS?: EQU FHC 
LUGS EQU H#D 
SINX: EQU E#E 


»CONSTANTS 


Ries EQU BPH? 
Ris EQU H#i 
R2: BQU E#H2 
BOs EQU H#S 
h4s QU H#4 
kos EQU E#5 
ROS EQU E#S 
Rvs EQU H#"? 
Res EOU HHS 
RG: LQU H#9 


k1@: HQU H#A 
(sly EQU H#EB 
Ric? FQU E#c 
His: KQU HAD 
R14: EQU HAE 
Rios EOQU H#F 


AMDOS /29 AMDASM MICRO ASSEMBLER, V1.1 
CPUII DEFINITIONS 





32923 SOURCE MODIFIERS 
RADB: EQU ZB#2@1 
RAQ: EQU SB#010 
DARE: EQU 3B#122 
DADE: EQU SB#1¢1 
DAQ: EQU 3B#112 
31/0 

IOINS BQU 12H#01 
BIN: EQU 12H#1@ 
BOUT: EQU 12H#a8e 
LMAR: EQU 12H#1@ 
YREG: FQU 12H#¥2 
AOQUT: EQU 12H#4@ 
ICUT: EQU 12H#@4 


sCARRY SELECT 


ONzZ: 
GZ: 


EQU 
EQU 


2BHO1 
2RH1A 


;SUB DEFINITIONS 


SUBG: SUB 36X,1BH@,4VX,4VX,4VX 

SUB1: SUB 36X,1B4@,4VX,4VX,4VX ,4VH#F 
SUR2: SUB 36X,1B#0,4VX,4VX,4%,4VH#F 
SUB3: SUB 3VBH@@0,16X,1B40,13xX 

SUB4: SUB 36X,1B42,12X 

SUz5: SUE 44X,15#0,15X 

SUB6: SUB 44X,1B40,15X 

SUB7: SUB 26X 

SUBS: SUB 36X,1B40,4VX,6X,4VH4F 
SUBS: SUB 36X,1B¥0,4VX,4X,4VX,4VH#F 
SUB1@: SU3 36X,1B#0,4VX,4VX, 4X | 
SUR11: SUB 24X,2VE#O, 34% , 4240000 ,1F#1,5X 
SUE12: SUB 77X,1B#1,12VXEHO% 

SUB13: SUB SPF,3VEB#00@,16X ,1B#0,13% 
SUR14: SUB 24X,2VE#Q0,24X ,4E#0000, 22#12 
SUB15: SUB 23X,1B#@,6X 

SUB1G6: SUB SPF,3B#000,16%,1VB#0,13X 
SUB17: SUB 54x 

SUB1&: SUB 22X,1B#0,7X 

SU319: SUB 16%,1B#2,13X 

SUR20: SUE 1X,1VB#2,14X 

SB21: SUB 38X,H#B,20X 


3CCU CONTRCL 
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ACK: DEF 66X,H#9,20X 
OBF: DEF 66X,H#A,<@X 
CNT: DEF 66X,H#F,20X 
GRD: DEF €6X,FH@,2OX 

JZ: DEF SUB11,H#2 ,SUB2O 
CJS: DEF SUB11,E#1,SUR2@ 
JMAP: DEF SUB11,H#2,SUB2@ 
CJP: DEF SUB11,H#3,SUB22 
PUSH: DEF SUB11,H#4,SUP22 
JSRP: DEF SUB11,H#5,SUBZ22 
CJV: DEF $UB11,E#6,SUBe@ 
JRP: DEF SUB11,h#7,SUR22 
RFCT: DEF SUB11,H#@,SUB2¢ 
RPCT: DEF SUB11,H#9,SUB20 
CRIN: DEF SUEB11,H#A,SUF2@ 
CJPF: DEF SUB11,H#B,SUB2@ 
LDCT: DEF SUB11,H#C,SUBE?@ 
LOCPF: DEF SUB11,H#D,SUE2@ 
CONT: DEF SUB11,H#E,SUBRLO 
JP: DEF S$UB11,H#F,SUBES 
JSR: DEF SU514,H#01,SUB20 
RTN?: DEF SUB14,H#OA,SUB2E 


;SATARED CONTROL FIELD 


GOTC: DEF SUB12 
COUNT: DEF SUB12 
PUT’ DEF 77X,1B42 ,12VXHHOZ 


;PCLARITY CONTRCL 


Ts DEF 65X%,1B#1,24X 
ee DEF 65X,1B40,24% 


32923 CONTROL/FUNCTIONS 


IN: DEF SEX,13#1,c4#F OX, H#r ,HH#O,19X,1B#0,135X 
QUT: TEF 26X%,1B40,8X,H#F,HH#C,H#6,SUBS 
YOR: DEF 56X,1B#1,53% 

RIGH: DEF SUBS ,H#2,33#@12,SUB19 

ORS: DeF SUB1,H#1,SUB2 

SSR: DEF SUB1,H#2,5U385 

ADD: DEF SUB1,H#3,SUBS 

PAS? Def SUB2,H#4,SURBS 

SOMS $ DEF SUB2Z,E#5,SUBS 

PAR: DEF SUB9,H#6,SUB3 

COMR: DEF SUBO,H#7,SUPE 

LOW: DEF SUBS ,H#&,3X,SUB19 

CRASS: DEF SUB1,H#9,SUBS 

XNAS?s DEF SUEB1L,H#A,SUBS 

XOR: DEF SUB1,E#8,SUBS 

AND: DEF SUB1,H#C,SUBS 

NOR: DEF SUP1,E#D,SUBE 

NAND: DEF SUB1,H#=,SUBS 

OR: DEF SUB1,BA#F,SUBS 


32S@2 SPECIAL FUNCTIONS 
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UMUL: DEF SUBO,H#2,SUB1i6 

TCM: DEF SUBG,H#¥2,SUB16 

SMTC: DEF SUBIO,H#5,SUB16 

TCMC: DEF SUBG,H#6,SUB1S 

SLNs DEF SUB10,H#8,SUB16 

DLN: DEF SUBG,H#A,SUB16 

TDIV: DEF SUBQ,HHC,SUB16E 

TDC: DEF SUB@,H#E,SUB16 

INC: DEF SUB10,H#4,SUB16 

SDQP: DEF SUB4,H#5,4X,SUB3 

SUQP: DEF SUB4,F#D,4X,SUB3 

LOPT: DEF 36X,1B40,8X,4VX,H#6,HH#6, SUBS 
RMOV: DEF SUB2,H#4,SUB3 

GMOV: DEF 36X,1B42,4VX,8¥,MEM,H#4,3B#010,SUB19 
SDRL: DEF SUBiO@,H#1,H#4,SUE2 

SURL: DEF SUB1@,H#9,H#4,SUB3 


°2004 SHIFT CONTROL 


SDDE: DEF SUBY,H#3,SUBE 
SDUH: DEF SUB?,H#7,SUB5 
SDDL: DEF SUB7,H#6,SUB6 
SDUL: DEF SUB7,H¥6,SUB5 
RDD: DEF SUB7,H#F,SUBE 
RDU: DEF SUB? ,H#F,SUBS 
SSXO: DEF SUB7,H#E,SUBE 
RSD: DEF SUB7,H#A,SUBE 
HSU: DEF SUB7,H#A,SUB5 
SUL: DEF SUB7,H#2,SUB5 
SUd: DEF SUB7,H#3,SUB5 
SDL: DEF SUB7,H#@,SUBE 
SDE: DEF SUBY,H#1,SUB6 
SDMS: DEF SUBY,H#5,SUBE 
SMS: DEF SUB7,H#2,SUBE 
SDDC: DEF SUB7,H#7,SUB6 
SDUC: DEF SUB7,H#4,SUB5 


32904 MICRC INSTRUCTION CODES 
RSTI: DEF 3@X,6BH#2@0011,SUB17 


SWAP: DEF 3 X,6BH#000010 ,SUBI? 
SHLD: EQU 1B8#1 


32904 MACEINE INSTRUCTION CODES 
LMA? DEF SUEBI5,6B#200902,SUB17 


ROTA: DEF SUE15,6B40@9@11,SUP17 
SEOLD: ODFF 23X,1B4#@,66X 


32904 MICRO STATUS SELECT 
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MIZ: DEF $UB18,6B#4#@191900,SUB21 
MIO: DEF SUB1S,6B#010110,SUB21 
MIC: DEF SUB1&,6B#%9110108,SUB21 
MIS: DEF $UB18,65#4@11118,SUB21 
32904 MACHINE STATUS SELECT 

MAZs DEF SUB1S,6B#10019@,SUE21 
MAO: DEF S$UB1&,6B#189110,SUB21 
MAG: DEF SUB1&,6B#10101¢@,SUB21 
MAS: DEF SUB18 ,628#14111¢@,SUB821 
»DEVICE DISABLE 

ALUCHF: DEF 7 X,1B#1,13% 

ALLOFF: DEF 7 X,3B#111,13% 

»LOAD CONSTANT 

CONST: DEF 16 VXH#@%,4X,1B#@,69X 
>BCD STATUS REGISTER CONTROL 

ENR: DEF 16X,1B#@,73X 

CLSR2: DEF 17X,1B#0,72X 


ENSR1: DEF 18X,1B#1,71% 
CZERO: DEF 19X,15#0,70%3 


END 
TCTAL PHASE 1 ERRORS = @ 
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C10 
C1igY¢ 
0121 
C122 
d1go6 


G104 
2165 
Z1IZE 
J107 
Z1CE 
g1ds 


Z10A 
Z125 
212C 
0100 
@124h 
Z10F 
0114 


©111 
@112 
O11e 
2114 
@115 
Z11€ 
g117 
@11€ 


@1i¢ 
@11A 
C118 
Z11C 
G110 
G11 
O11 
Z12¢e 
0121 
C122 
G1z25 
J124 
Yizs 
C1<é 
127 
8128 
Z129 
J12A 
w12kh 
€12C 
G12) 
C12k 
@1e2P 
0130 


;ALVANCE MICnCG TEVICES 
; AMZSZS AND AM2O@4 CPUIT SOURCE FILE 


INP: 


CUTP: 


USM: 


DIV: 


LOOF1: 


SCALE1: 


LOCP2: 
SKIPE: 


SKIPS: 


ORs E#1¢@ 

ALUGFF & T & CBF & CJP & GCTO INF 
ALUGFF & PUSH 

IN & T & OBF & LOOP & PUT ICIN 
ALUCFF & RTN 


CUT & CONT & PUT YREG 

ALUOFF & PUSH 

ALUCFF §& F & ACK & LOOP & PUT IOUT 
ALJUCFF & PUSH 

ALUCFF & T & ACK & LOOP 

ALUOFF & RTN 


LOW R1 & JSR & GOTO INP 

PAK R2,Ri5 & JSR & GOTO INP 

LQPT Ri5 & F & GRD & PUSH & COUNT GE 
UMUL R1,R1,83 & F & CNT & SDDL & RFCT 
PAR K15,R1 & JSR & GOTO OUTP 

QaMOV R15 & JSR & GOTO OUTP 

JP & GCTO USM 


LOW Rk1 & JSK & GOTO INP 

PAR R@,R15 & JSR & GOTO INP 

LQPT R15 & F & GRD & PUSH & COUNT 2@D 
TCM R1,R1,k@ & F & CNT & SDDL & RECT 
TCMC R1,R1,R@ & SDDL & CCNT CZ 

PAR R15,R1 & JSR & GOTO OUTP 

QMOV R15 & JSR & GOTO OUTP 

ALUCFF & JP & GCTO 5” 


LOW R1@ & JSR & GOTO INP 

PAR k7,ki5 & JSR & GOTO INP 

PAR R1,R15 & JSR & GOTO INP 

PAR R4,R15 & CONT 

FAR RS,R? & CONT 

PAR R2,R1 & T & MIZ & CJP & GOTO ABCRT 
SMTC R2,R2 & CONT CZ 

SMTC RS,RS & T & MIO & CJP CZ & GOTO SCALE1 
ALUOFF & T & MIO & CJP & GOTO SKIPE 
SURL RS,R3 & SUL & CONT 

SURL R2,R2 & SUL & CONT 

ALUOFF & JP & GOTO LCOP2 

LQPT R4 & JSR & GOTO SDIVD 

ALUOFF & JP LCOP? 

SSR R15,R3,R2,YBUS & CONT ONE 

LQPT R48 F & MIC & CJP & GOTO SKIPS 
ALUOFF & JSR & GOTO SDIVD 

SDRL R2,Re2 & SDL & CONT 

ALUOFF & JP & GOTO LOOPe 

ALUCFF & F & GRD & LDCT & COUNT 2&C 
DLN R1,R1,R7 & T & GRD & RDU & PUSH 
TDIV R1,R1,R7 & F & CNT & RDU & RECT CZ 
TDC R1,R1,R7 & SUH & CONT CZ 

QMOV R15 & JSR & GOTO OUTP 
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£151 
O18z 
0135 
2134 
@1E5 
J13€ 
0137 
¥1SE 


0139 


C13A 
Z1zB 
Z13C 
13D 
@1Si 
O13F 
2142 
2141 
B14z 
21432 
@144 
W145 


Z14E 
4147 
W148 
C149 
G14A 
Y14F 
2£14C 
14D 
J1i4k 
¢14F 
Z1d¢ 
£151 


W152 
Yd 
6154 
ibe 
C15E 
C157 
W15E 
W15¢ 
w15A 
€15B 
Wid5c 
Z151 
WL 
JiSF 
CLE? 
161 
2162 
YW1ies 
0154 
165 


SDIVD: 


NEG? 


RET: 


SLNOR™ : 


AGAIN: 


EMD* 


PINGRM: 


POS: 
CNTs 


SUB: 
PENDS? 


PAR R15,R1 & JSR & GOTO CUTP 
ALUOFF & JP & GOTC DIV 

PAR R1,R1 & CONT 

ALUOFF & T & MIS & CJP & GOTO NEG 
PAR R1,R1,ADRQ & SDDL & CONT 
ALUOFF & JP & GOTO RET 

PAR R1,K1,ADRQ & SDIL & CONT 

QMOV R4 & CONT 

PAR R10,R10 & RTN ONE 


JSk & GOTO INP 

LQPT R15 & CONT 

SIN R2,R2,CFF & CCNT & SHOLD 

MAZ & T & CJP & GOTO ABORT 

MAC & T & LOW R@ & CJP & GOTO END 

SLN ke,R2 & MAC & T & CJP ONE & GOTC END & SUL. 
SIN R2,k2 & MIO & ¥F & CUP ONF & GOTO AGAIN & SUL 
SPQP & SMS & CONT 

SRS R2,R2, RY & CONT 

QMOV K15 & JSR & sOTC OUTP 

PAR R15,R2 & JSR & GOTC OUTP 

JP & GOTC SLNORM 


JSf# & SOTO INF 

LQPT R15 & JSR & GOTC INP 

DLN R15,R15,R15,CFF & CONT & SHOLD 

MAZ & T & CJP & GOTO ABORT 

LOW R2 & MAC & T & CIP & GOTO ENDZ 

DLN R15,R15,Ri5 & SDUL & MAO & T & CJP & GCTCG JUMP1 
DIN R15,R15,n15 & SDUL & MIO & T & CJF & GOTO JUMP1 
PAR R2,R2 & JP ONE & SOTO LOOP4 

PAR R2e,Re & CCNT ONE 

ODRQ@ R1iS,Ris & SDMS & JSF & wOTO OUTP 

QMOV R15 & JSR & GOTO OUTP 

JP & GOTO DLNORM 


TOW R1@ & CONT 

LOW RZ & JSR & GOTC INP 

PAR R1,n15 & COAT 

PAK ne,R®,,TARB & CONST BOGS & CONT 
FAR 4¢é,R@,,DARB & CONST @8@3 & CONT 
PAR h4,RO,,DARP & CONST F4BEFF & CONT 
PAR n5,ke,, LARS & CONST 4860 §& CONT 
FAR R6,RO,, DARB & CONST O¢28 & COAT 
ORS RZ,R1L,RS & CONT & SHCLD 

AND R5,R5,R4 & CONT 

SDRL H4,k4 & MAS & CJP & GOTO ENDS 
SUnrL kd,k’ & T& MAS & CUP & GOTO POS 
OR RS5,RS & JP & GOTO CAT 

CR RS,R2e & CONT 

SRS RE,RE,R10 & CONT 

SDRL R2e,Re & T& MIZ & CJP , SHLD & GOTO ENDS 
SDRL R3,RS & T & MAS & CJP & GCOTC SUE 
ATL Kk2,RO,RS & JP & GOTO CYCLE & SHOLL 
SRS R@,RE,RS & JP & GOTO CYCLE & SHOLD 
JP & GOTC SQRT 
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G1iée€ ABCKT: ALUOFF & JP & GOTO ABORT 
£167 JP & GOTO DIV 
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G10 
$101 
J18z 
G18 
2124 
B135 
2186 
2187 
0136 
Q1d% 
Giga 
Y1dE 
¢1gc 
12D 
J1Gk 
V1gF 
@1ile 
111 
C1llzé 
C1ile 
0114 
e115 
@1i1€ 
0117 
w1le 
g119 
C11A 
C11KE 


G11C 


XXXXXXXXXXXXXXXX 
1116120011X21100 
XXXXXXXXXXXXXXXX 
IXXXXXG1IGOXS1LXKX 
XXXXXXXXXXAXXXXX 
1110101101X002¢2 
XXXXXXXXXXXXXXXK 
18900010190 X91XXxX 
XXXXXXXXXXXAXXXXX 
LXXXXX1110XOL OCA 
XXXXXXXXXXXXXXXX 
LXXXXXS1IGSXG1XXX 
XXXXXXXXAXAXXXXX 
1018011101X91400 
XXXXXXXXXXXXXXXX 
UXXXXXC1GOXCAXXY 
XXXXXXXXXXXXXXXX 
111@0111@1X@1Xz7%X 


YXXXXXXXXAXXXKXX 


1JO0I01210 X01 XXX 
XXXXXXXXXXAXXXXXX 
LOLOL OVLTIXGZ1LB 
XXXXXXXXXXAXXXXX 
1OLLLLCLG1KSG12B 
XRAAAAXXXXXAXXXK 
10G0d05d0100XC0109 
XXXXXARXXXAXXXXX 
1011111020200XKX 
KXXXXXXXXXAXXXKXX 
1POCCe2LV1KAG2102 
EXXRXERXXXKAXXXX 
1dd020SDI01X20108 
AXAXXXAXXXNAXXXXE 
LX XXXXLLIVXS X11 
AXXXXXXXXXXXXXXX 
1SELOLLLGO1XLL1LL 
EXXXXAXXKXAXXKXX 
1208062001 KD0162 
KXXXXAXXXXAXXKXX 
LZLOLOC1ILOGOXCU1LS 
XXXXXXXXXXAXXXXX 
LZILLLIS ESBS SAKX 
XXXXXAAXXAKAKAXK 
LXXXXX1L1LM SAD XXX 
AAXXXAXYXXXXXKKX 
LYOCOOGOLIKLC1G2 
KXXXXXXXAXKAKXXX 
1Z22OKPCCLKLG1G2 
AAKKKRAXXXXAY XXX 
LXXXXX1111X01190 
KXXAXAXRXXAXKAXXXXX 
I1ZELWVSPLL1IXSZ1 00 
KXXXXXXXXXAXKXXX 
1ZLLCLVLGIXKCO1GL 
AXXAXAXAAXAXKAEXXE 
1036090001 X90152 
KXXXXXXXXXKXALXX 
LXXXXXLLIG AC OXXX 


XXXXXXXXOGOKXXXKXX 
Z1IAIOILOAY 
XXXXXX¥XOOKXXXXXX 
XAXXXXXXXX 
XXXXXXXXOOXXXXXX 
CQOBLOOGS1 
XXXXXXXXOOXXXXXX 
XXXXXXXXXX 
XXXXXXXXOOXXXXKXX 
COYOLSOCL1Y 
XXXXXXXXOOXAXXXKX 
XXXXXXXXXX 
KXXXXXXYOCXKXXXX 
2000002128 
XXXXXXXX¥@OKXXXXX 
XAXXXXXXXX 
XXXXXXXXOOXXXXXX 
XXXXXXAXXXX 
XXXXXXXX@OXXEXXXX 
XXXXXXXXXX 
XXXXXXXYSOXXKXXK 
C1ILLEL BLS wv 
XXXXXXXXZOXARKIA 
GISOOLESSL 
XXXXXXAXOCK XXXXX 
0000981112 
XXXXXXXXOOC11E XX 
XXXXXXXXXX 
XXXXXXXXSOXXXXXX 
CILDOAPL1ES 
XXXXXXXKSSXXXXXX 
01002061900 
XXXXXXXXOGOXXAXXX 
12CC21218 
XXXXXXXXODIOXXXXAX 
Z1ILLLOL OCH 
XXXXXXYXEOSKXXXXX 
J120ADOVIZ 
XXXXXXXXOCXXXKXX 
2OCCCC11¢1 
XXXXXXXXJIJO112AAK 
XXAXXXXXXX 
XXXXXXXX1OC118XX 
XXXXXXXXXX 
XXXXXXXXOSOCKKXXAX 
SGI VSLOC1 GS 
XXXXXXXXDOXXXXXX 
0120022102 
XXXX¥XXASPKXXKYX 
2190210@01 
XXXXXXXXOOXXXAXKX 
Z1ILPOVALSG 
XXXXXXXXAOXAXXAKX 
C1IPAPSSLBC 
XXXXXX¥XCOXAXAKK 
J1ZLGBI09S 
XXXXXXXXSLXXXXXX 
XXAXXXXF XK 


XXXXXXAXXXXXXXXX 
EXXXXXXXXXXEXXXX 
XXXX11111XX4XXXX 
XXXXXXXXXXXXXKXX 
XXXX@XXXXXXXX111 
XXXXXAXXXXXXAKXXX 
XXXXXXXXXX AAXXXX 
XXXXXXXXXXXXXXXX 
XXXXXXXXXXAAXKXX 
XXXXXXXXXX XAXXXX 
XXXXO@@OS1LX XXXXXX 
XXAXDIOOGOXAXK111 
XXXXZXXXXXAXXK1I11 
XXXXZ2Z201GD001C 00 
XXXXO1111XX2KV928 
XXXXDLILIXXXXXXX 
XXXXXXXXXXAKXXXE 
KAXXDCCOLAXXAXK XK 
XYXXSCLEEXXXX111 
XXXX@XXXXXXAXX111 
YALX ISGC1IBIL1e SS 
XXXXS@UCL1LZUC1IGLS 
XXXXSLLLAX¥ XX SL 
KEXXOLILIXXXXXXX 
XXXXXXXXYKXAKAXXKX 
XXXXSICIS|XAKXAXKK 
EXKRXOO111XXXFX111 
XXXXOSKLLLXXAX111 


XXXXOZIZEXAKXA111 


XXXXXXXXXXKXSGVO 
XXXXXXXEXXX XVID 
X¥11112000xxXx20a0 
XXX XXXXXXXXXBOOG 
111@0011990e0090 
XXXXXXXXXXKXZOVO 
XXXXXYXXXXXXACOS 
XXXXXXXXXXXXGLGE 
XXXXXXXLXXXXGVIL 
XXAXXXXXXXXX CGO 
X111110COXXX EGG 
11111211¢9200z02 
1011¢211¢2902¢0 
2G02982020ReC020 
11111411292008000 
¥1111010001e¢000 
XXX XXXAXXAXXZWV*G 
¥11111000xxx2902 
11111011¢200200¢ 
12110211920e¢000 
9B91UACBLO209020 
9011222009222 000 
111110112¢ecaco2 
¥111121200180202 
XXXXXXFY XXX KAW 
X11111220XXX2ee0 
1111191192000020 
11111911¢¢02¢20¢ 
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w11t 
@ii¥ 


211F 


G154 
6135 
@136 
187 
138 
0159 


XXXXXXXXXXXXXXXX 
LXXXXFLLIAXS? AXX 
KRAXXARXAXXYARXXX 
1112110011X24102 
XXXXAXAXAKAXXKAXX 
LXAXAAILILS XACXXY 


i XXXXXXXXXXAKXXXX 


1112112011X22102 
KRXXXXAXXXXAKAXKX 
1110112011X¢112@ 
XXXXARAXXXXXXXXXXX 
LAAXXXL11IG2O 0 KKK 
XXXXXXXXXXAXXXXX 
LXXXXX1112GS2X XX 
XXXXXAXAXXXXAXXXXX 
VXXXXX1111Xd119G0 
AXAXXAXXXXXXAXKXX 
1OOLCSLOCLIXGC19B 
KXXXXXXXXXAXXXXX 
TXXXXX1111XO1XXX 
XXXAXAXXAXAXAXXAAXX 
LXXXXX1119XIOXXX 
XXXXXXXXXXXXXXXX 
1212112011X80100 
XXXXXXXXXXAXXXXX 
12O6COCO60G1X0G1102 
XXXXXXAXXXXAAXALY 
LXXXXX1LLIGDOSXXX 
XXXAXXXXXXXXXXKXX 
LXXXXX1111 X11 
XXXXXXXXXXAKXXXX 
1OGLOC11GOXCZ112¢ 


> XXXXXXXXXNEXAXXXX 


1120000100300 KXX 
XXXXXXXXXXXXXKXXX 
1@11111 220 400XXKk 
XXXAXXXXXKRXXXXX 
LXXXXXLTLIGOOOKXX 


» XXXXXAXXXAXXXXXAK 


16G0009C01K00100 
XXXXXXXXXXAKKXXX 
1OOOUVLGOLIXGC102 
XXXXXXXXXXXXXXXK 
LXXXXX1111X01122 
AXXXXAXXXXXXXXXX 
LXXXXX1119X0OXXX 
XXXXXAXKXAXXXXXXX 
11101190011X@1120 
XXXXXXXXXXXXXXXX 
TXXXXX11TIGVST XXX 
EXXXXXFXARAXXXXX 
1XXXXX1111XG1108 
XRXXXAXXAXAXXXXX 


TXXXXX111A0COXXX- 


XXXXXXXXXXXXXXXX 
LXXXXX111OXOEXXX 
XXXXXXXXXXAXXAXX 
1300061010 XO9GXXX 


XXXXXXXXDAXXXXXX 
KRXXXXXXXXA 
XXX¥XAXCKCLKAXKAL 
181190119 
XXXAXXXX1EX XXXXX 
XXXXXXXAXX 
XXXXXX2X1SXXXXS1 
212231201901 
XXXXAXOX OAR KXXCT1 
012010100 
KXXXXXXXPOECO1SPXX 
KXXERNXXXX 
XXXXXXXXJOVBJ1IXX 
XXXXXAXKXXX 
XXXXXXXXDEXXKXKX 
4160140111 
XXXXXXXX@OXXXXXX 
0120112013 
XXXXXXXXO1XXAAXX 
XXXXXAXX XXX 
XXXXXXXXOLXXXXXX 
XXXXXXXXXX 
XXXXXXOXCOXXXXL1 
G12€1811282 
XXXXXXXXOOKXXXXXX 
12110011 
XXXXXXYXOCGCLOSY X 
XXXXXXXXXX 
XXXXXXXXO@PAXXXXX 
@1221@¢111 
XXXXXXXXOOXXXXXX 
Q22OGL21100 
XXXXXXXXOC1111 XX 
XXXXXXXXXX 
XX¥XXXX¥X101111Xx 
XXXXXAXAXXKX 
XXXXXXXX130011XxX 
XXAXXXXAXX 
XXXXXXXXOOXXAXXX 
2182B001290 
XXXXXXXXOOXXXXXX 
CIGCCL1ICD 
XXXXXXXXOOXAXXKX 
100011001 
XXXXXX¥XOOXAXXXX 
XXXXXXXXXX 
XXXXXXOXSZAXXXE1 
@10¢11¢111 
XXXXXXXXZOO11GXX 
XXXXXXXXXX 
XXXXXXXXOCXXXKXX 
6100111062 
XXXXXXXXOO@119XxX 
XXXXXX KXXX 
XXXXXXXXOOXXXXXX 
XXXXXXFXKX 
XXXXXXXXOIXXXXXX 
XXXXXXXXXX 


XXXXOOO1IXXXXZ1i1 
PAISSLCCACK AXXC VE 
XAKXSPSALCS CLK XX 
110060110011 X42% 
SPLAUSKXXXXXAXRKXKXX 
XXXXOCZLIMSLLK XY 
XXXXOO@DIGLZO1LGLXX 
XXXX¥XXXXXXAXXXX 
XXXXO@XXXXXXXXC10 
XXXXXXXXXXXAXXXX 
X¥XXXG611112@11001 
LZAISSKXXXXXAXXL1 
XXXAXXAXAXAXKAXXXA 
XXXXAOG1LSLOISK XX 
XXXXXAXXXXXXXXXX 
XXXXXXAXXXXXXXXX 
XXXXO¢LG1GOC1211 
XXXX00@210@01011 
XXXX000010001011 
XXXX¥O1111X XAAXXAXX 
XXXXO1111XXXXOL2 
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Introduction 


In order to access instructions and data in an orderly manner 
within a computer, a Program Control Unit is usually used to 
provide the most efficient mechanism for program control. A 
program is a set of instructions which direct the processor to 
perform a specific task. Ordinarily, program instructions are 
stored in sequential memory locations. During the normal pro- 
cessing of a program, an instruction is fetched from the location 
specified by the program counter, the instruction is executed, the 
program counter is incremented, and another fetch and execute 
cycle begins. The addressing mechanisms that such control unit 
might employ are various. Indeed there are some machines that 
literally use dozens of addressing modes to fetch instructions and 
data. In this discussion of program control units, several of the 
addressing modes and their common implementation techniques 
will be discussed. The addressing modes used commonly in 
today’s machines include register, immediate, direct, indirect, 
index, and relative and various combinations thereof. 


Data Formats 


Technically, an instruction set manipulates data of various length 
words. Generally speaking, most 16 bit minicomputers can ma- 
nipulate data of three different word lengths: 8-bit bytes, 16-bit 
words and 32-bit double words. This data may represent fixed 
point numbers, floating point numbers, or logical data. The data is 
used as operands for the instructions, and is manipulated as 
indicated by the particular instruction being executed. 


Typically, fixed point data is treated as signed 15-bit integers in 
the 16-bit representation or as signed 31-bit integers in the 32-bit 
double length notation. Positive and negative numbers are rep- 
resented in the ordinary 2’s complement notation with the sign bit 
carrying negative weight. Positive numbers have a sign bit of zero 
and negative numbers have a sign of one. Tne numerical value of 
zero Is always represented with all bits LOW. 


Floating point numbers consist of a signed exponent and a signed 
fraction. Many different formats are used by manufacturers in 
expressing floating point data and these variations will not be 
described here. Let it simply suffice to say that the floating point 
number represents a quantity expressed as the product of a 
fraction times the number 2 raised to the power of the exponent. 
In some cases, the number 16 is raised to the power of the 
exponent. Typically, all floating point numbers are assumed to be 
normalized prior to their use as operands. No pre-normalization is 
performed and all results are post-normalized. Usually, the float- 
ing point instruction set will normalize un-normalized floating 
point numbers. 


Logical operations are used to manipulate 8-bit bytes, 16-bit 
words or 32-bit double words. All bits participate in the logical 
operations. 


Instruction Formats 


Various minicomputers use different types of instruction formats 
ranging from the very simple straight forward formats to the more 
complicated difficult to decode formats. For example, a register to 
register format can consist of a simple 8-bit opcode and two 4-bit 
source operand specifiers. On the other hand, it may consist of a 
byte or word specifier, an opcode specifier, source and destina- 
tion register specifiers, and mode specifiers for each of the source 
and destination register selections. Again, it is not the purpose of 
this application note to describe all of the trade-offs in selecting 
instruction formats but rather to select a simple format such that 
the student of bipolar microprogrammed microprocessors can 
understand the techniques used by instructions for operating the 
machine. 


Thus, we will use a few 16-bit and 32-bit formats in this application 
note to demonstrate the function of the program control unit in 
various types of instruction execution. 


Instruction Types 


For purposes of this application note, we will define nine different 
instruction types using various addressing modes. As we define 
these instruction types, we will use the basic ADD instruction as 
the example in all cases. It should be recognized that the opera- 
tions of the instructions are similar for all the arithmetic as well as 
logical type operations. However, by using the ADD instruction it 
will be easier to describe the operation of each of these instruc- 
tions rather than to try to be very general in their description. 
Figure 1 shows all nine instruction types with their appropriate 
names. As is seen, four of the instruction types are single 16-bit 
word instructions while five of the instruction types are double 
word or 32-bit, instructions. The advantage of the double word 
instructions is that a second word can be used as an address 
whereby it provides an index value or a second word can be used 
for data which is used as an immediate value. 


Register-to-Register Instructions 


When the register-to-register (RR) instruction is executed, It is 
simply a technique for selecting two of the machine’s internal 
working registers in order to execute the desired operation. The 
instruction is fetched from memory and placed in the instruction 
register and the source register R2 and second source register 
R1 are selected as the two source operands for the ALU. Register 
R1 is the destination register in addition to being a source register 
and the results of the ALU operation will be placed in the register 
specified by the R1 field. In the instruction format shown in Figure 
1 for the register-to-register instruction, the 8-bit opcode field 
specifies the machine operation to be performed. The next 4-bit 
field, R1, in the instruction format specifies the address of the first 
operand. In most machines, the R1 field is normally the address 
of a general register. The 4-bit R2 field in the register-to-register 
instruction format specifies the address of the second operand; 
this also is normally the address of a general register. In most 
machines, the R1 field also in addition to being a source operand 
is the destination general register select. Thus, the results of the 
operation are stored in the register selected by the R1 field. 


The RR instructions are used for operations between registers. 
We are assuming in this discussion that the machine contains 16 
general registers which function as accumulators or index regis- 
ters in all arithmetic and logical operations. Each general register 
contains a 16-bit word consisting of two 8-bit bytes. For arithmetic 
operations, the most significant bit is considered the sign bit using 
2’s complement representation. The general registers of the 
machine are usually numbered from 0 to 15 (decimal) and written 
in hexadecimal notation as 0 through F. In this example, the 
general registers have not been given specific functional assign- 
ments. However, in some machines certain registers are as- 
sumed to perform specific functions. These can include specific 
stack pointer registers and program counter registers. Figure 2 
depicts the typical signal path for executing the RR instruction ina 
bit-slice system. 


The actual operation of the Register-to-Register Instruction is as 
follows. First, the instruction is fetched and placed in the instruc- 
tion register as shown in Figure 2. This is part of the fetch routine. 
Next, the instruction is decoded via the mapping PROM and the 
appropriate microinstruction in the microprogram memory 
selected and placed in the pipeline register. Then, the instruction 
is executed where the two registers in the general purpose regis- 
ters of the Am2903 are selected by the contents of the R1 and R2 
fields of the instruction register. The actual microcode required to 


191 


192 


Register-to-Register 
0 778 11412 15 


ADD INSTRUCTION 


(R1) < (R1) + (R2) 


Register-to-Memory Reference 


(R1) <— (R1) + [(X2)] 


Memory-to-Memory 


[(x1)] <— [(X1)] + [(x2)] 


E 
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oO 
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Register Short Immediate 


(R1) < (R1) + DATA 


Register-to-Indexed Memory 

ADDRESS (R1) <— (R1) + [(X2) + A] 
(R1) —(R1) + DATA + [(X2)] 
ADDRESS [(x1)] <— [(X1)] + [(X2) + A] 


(R1) <— (R1) + DATA 


DATA 


Note’ (R1) means the contents of register 1. 
[(X1)] means the contents of the word whose address s in R1. 


[(X1)] <— [(X1)] + DATA 





Figure 1. Various Instruction Types for the ADD operation. 
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Figure 2. Register-to-Register Instructions Select Two Registers in the Am2903 Array for Instruction Execution. 








execute this instruction is shown in Figure 3. Here, we assume 
the Program Counter (PC) value is contained in one of the gen- 
eral registers and can be selected by microcode as well as the R1 
and R2 fields. This was shown in Chapter 3. 


Register-to-Memory-Reference 


The register-to-memory-reference instruction is one whereby the 
contents of the memory location pointed to by the register iden- 
tified with the X2 value is fetched from memory and then added to 
the register value specified in the R1 field. The result of this 
Operation is placed in the register specified by the R1 field. 


Figure 4 shows a general block diagram of the hardware used to 
implement the instruction types described in the first part of this 
application note. As shown, the memory address register can be 
driven by either the Y outputs or the DB outputs of the Am2903s. 








Microinstruction 
Operation 
PC — MAR; PC + 17 PC 
Fetch Inst to IR 
Decode 
Ri+R2 —>—Ri 


TO | T1 | T2 


X 
X 
X 








T3 


In addition, the Y outputs of the Am2903s can be placed onto the 
memory data bus by means of a three-state buffer. The computer 
control unit is intended to be representative of that described in 
Chapter 2 of this application note series. For purposes of this 
discussion, we assume the program counter (PC) is one of the 
general purpose registers within the Am2903 register stack. 
Later, we will change this concept and use the PC external to 
Am2903. 


The operation of the register-to-memory-reference instruction as 
depicted in Figure 1 can best be described by referring to Figure 
5. Here, we see the first three microinstructions that represent the 
fetch routine for the currently described machine. First, the pro- 
gram counter is placed in the memory address register and the 
program counter is incremented and returned to the PC register. 
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Figure 3. Register-to-Register Instruction Microcode. 
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Simple Memory Addressing Scheme with PC in the ALU. 
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Microinstruction 
Operation 


PC — MAR; PC + 1 > PC 
Fetch Inst to IR 

Decode 

(X2) > MAR 

MEM + R1 > R2 


TO | T1 


Next, the instruction is fetched from memory and placed in the 
instruction register within the CCU. Thirdly, the instruction is 
decoded via the mapping PROM and the appropriate micro- 
instruction selected and placed in the pipeline register. To exe- 
cute this particular register-to-memory-reference instruction, it is 
necessary to place the contents of the register specified by the X2 
field into the memory address register. Then the contents of 
memory can be fetched and the operand added to the value 
currently contained in the register specified by the R1 field. The 
result of this operation is placed in the register specified by the R1 
field. All totaled, the execution of this register to memory refer- 
ence instruction requires five microcycles as depicted in this 
example. 


Memory to Memory 


This instruction is one whereby the memory location pointed to by 
the contents of the register specified in the X2 field is fetched and 
the memory location pointed to by the contents of the register 
locations specified in the X1 is fetched and these two operands 
are added together. At the completion of the instruction, the 
resultant is placed in the memory location as defined by the 
contents of the register specified in the X1 field. 


The Memory to Memory Instruction operation is also depicted by 
the block diagram shown in Figure 4. In fact, all of the next six 
instructions to be defined utilize the block diagram of Figure 4 to 
represent the hardware required for implementing these instruc- 
tions. 


The microcode required for the memory to memory instruction is 
detailed in Figure 6. The first three microinstructions represent 
the fetch routine. In the fourth microinstruction, the contents of the 
register specified by the X2 field are placed in the memory ad- 
dress register. Then, in the fifth microinstruction the contents of 


Microinstruction 
Operation 
PC — MAR; PC + 1 > PC 
Fetch Inst to IR 
Decode 
(X2) > MAR 
MEM > Q 
(X1) > MAR 
MEM +Q->Q 
Q —-MEM 


TO | T1 
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Figure 5. Register to Memory Reference Instruction Microcode. 


this memory location is loaded into the Q register within the 
Am2903. This value is temporarily held for use later. In the sixth 
microinstruction, the contents of the register specified by the X1 
field in the instruction is placed in the memory address register. 
On the seventh microinstruction, this operand is fetched from 
memory and added to the contents of the Q register with the result 
being placed in the Q register. In the eighth microinstruction, the 
current contents of the Q register is returned to the memory 
location. This memory location is specified by the contents of the 
register specified by the X1 field and is still in the memory address 
register. Thus, we have used the Q register as a temporary 
holding register for the data used in this instruction. 


Register with Short-immediate 


This instruction is a technique whereby a 4-bit field is added to the 
contents of the register specified by the R1 field. Thus, short 
jumps or branches can be executed within a range of zero to 
fifteen memory locations. The more significant 12-bits of the word 
are zero filled. 


The register with short immediate instruction operates very simi- 
lar to the register-to-register instruction. The microcode for this 
instruction is shown in Figure 7. The only difference between the 
register-to-register instruction and the register short-immediate 
instruction is that instead of adding operands specified by the R1 
and R2 fields, we take a data value contained in a four-bit field in 
the instruction as depicted in Figure 1 and add it to the contents of 
the register specified in the R1 field. The results of the operation 
are returned to the register specified by the R1 field. This addition 
is performed by taking the 4-bit data value shown in Figure 1 as 
the DATA and zero filling the twelve most significant bits. This 
gives us a 16-bit word ranging in value between zero and fifteen. 
Thus, short jumps can be implemented using this technique. 
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Figure 6. Memory to Memory Instruction Microcode. 
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PC — MAR; PC + 1 PC 
Fetch Inst to IR 

Decode 

R1 + Data >Ri1 
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Figure 7. Register Short Immediate Instruction Microcode. 


Register to Indexed Memory 


The 16-bit word in the register defined by X2 in the instruction is 
added to the address that is the second word of memory. Then, 
this address is used to fetch an operand from memory which is 
added to the contents of the register pointed to by R1. The results 
of this operation are then placed in R1. The instruction format for 
this instruction was shown in Figure 1. 


The Register to Indexed Memory Instruction is shown is Figure 8 
and executed in the following manner. First, the current PC value 
is placed in the MAR and PC + 1 is returned to the PC register. 
Next, the instruction at this memory location is fetched and placed 
in the instruction register. On the third cycle this instruction is 
decoded and the contents of the microprogram memory placed in 
the pipeline register. On the fourth microinstruction, the PC value 
is again placed in the MAR and PC + 1 is returned to the PC 
register. On the fifth microinstruction, the value at this location in 
memory is fetched and added to the contents of the X2 register 


with the result being placed in the MAR. And on the sixth mic- 
roinstruction, the operand pointed to by this address is fetched 
and added to the contents of R1 with the result being placed in the 
register pointed to by the R1 field of the instruction. 


Register to Memory Immediate 


In the register to memory immediate instruction, the contents of 
the memory location pointed to by the register specified in the X2 
field is fetched from the memory and the data value which is in the 
second word of the instruction is also fetched from memory and 
added to it. This result is then added to the contents of the R1 
register and the final result replaces the value currently in R1. 


The register to memory immediate instruction as shown in Figure 
1 is implemented using the microcode shown in Figure 9. Again, 
the first three microinstructions are the fetch routine. The fourth 
microinstruction is used to take the contents of the register 
specified by the X2 field and place it in the memory address 







Microinstruction 

Operation 
PC — MAR; PC + 1 >PC 
Fetch Inst to IR 
Decode 
PC — MAR; PC + 1 > PC 
MEM + X2 —> MAR 
MEM + R1 >Ri1 


TO ; T1 
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Figure 8. Register to Indexed Memory Instruction Microcode. 
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PC MAR; PC + 1 7PC 
Fetch Inst to IR 

Decode 

(X2) ~ MAR 

MEM + R1 > RI1 

PC MAR; PC + 1 —>PC 
MEM + R1—> R1 


TO | T1 
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Figure 9. Register to Memory Immediate Instruction Microcode. 
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register. Next, the operand at this memory location is brought into 
the Am2903’s and added to the contents of the register specified 
by the R11 field with the results returned to that register. The sixth 
microinstruction is used to set up the memory address register to 
fetch the second word of the instruction. The seventh micro- 
instruction brings this data value into the Am2903 ALU via the 
data bus and adds this value to the contents of the register 
specified by the R1 field. The result of the operation is placed into 
the register specified by the R11 field. 


Memory to Memory Indexed 


The memory to memory indexed instruction is one whereby the 
contents of the register specified in the X2 field are added to the 
second word of the instruction to form a new address. This 
address is then used to fetch an operand which is added to the 
operand selected by taking the contents of the register specified 
in the R1 field and using that as a memory address to fetch an 
operand. The result of this addition is then replaced in the mem- 
ory location pointed to by the contents of the register specified in 
the X1 field. 


The memory to memory indexed instruction is probably the most 
complicated of the instruction formats described in the application 
note. In all, nine microinstructions are required for its implemen- 
tation. Basically, the first three microinstructions are used to fetch 
the instruction from memory, place it in the instruction register, 
and decode the instruction for initial operation. Again, the basic 
fetch routine. Microinstruction number 4 sets up the memory 
address register to fetch the second word of the instruction and 
microinstruction number 5 is used to bring this value from mem- 


Microinstruction 
Operation 


TO | T1 


PC — MAR; PC + 1 > PC 
Fetch Inst to IR 

Decode 

PC — MAR; PC + 1—-PC 
MEM + X2 > MAR 

MEM ~>Q 

(X1) > MAR 
MEM+Q>Q 

Q —> MEM 
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ory into the Am2903 ALU where itis added to the X2 register. The 
results of the addition are placed into the memory address regis- 
ter during this microinstruction. This value is used to fetch a value 
from memory which is placed in the Q register using micro- 
instruction number 6. In the seventh microinstruction, the con- 
tents of the register pointed to by the X1 field are placed in the 
memory address register so that microinstruction eight can be 
utilized to bring this memory value into the Am2903s where it is 
added to the contents of the Q register with the result being 
placed into the Q register. Microinstruction number 9 is used to 
place this value back into the rhemory location as specified by the 
contents of the register pointed to by the X1 field. This memory 
address is still contained in the memory address register so that 
no updating is required. The total microcode required to imple- 
ment this instruction routine is shown in Figure 10. 


Register Immediate 


The register immediate instruction is a very useful instruction 
which allows data to be added to the contents of the register. In 
this example, the second word of the instruction is fetched and 
added to the contents of the register specified in the R1 field. 


Figure 11 depicts the microcode used to implement the register 
immediate instruction. Here, the first three microinstructions are 
the fetch routine for the instruction. The fourth microinstruction of 
this routine sets up the MAR to fetch the second word of the two 
word instruction. The contents of this memory location is brought 
into the Am2903 ALU and added to the contents of the register 
specified by the R1 field. The result of this operation is placed in 
the register specified by the R1 field. 
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Figure 10. Memory to Memory Indexed Instruction Microcode. 
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Figure 11. Register Immediate Instruction Microcode. 


Memory Immediate 


The memory immediate instruction is used to add immediate data 
contained in the second word of the instruction to a location in 
memory. The memory location is contained in the register 
specified in the X1 field of the instruction. 


The memory immediate instruction is similar to the register im- 
mediate instruction except that an indirect addressing scheme is 
used. Again, the first three microinstructions fetch and decode the 
memory immediate instruction. The fourth and fifth microinstruc- 
tions are used to fetch the data value which is the second word of 
this memory immediate instruction. Microinstruction number 4 
sets up the memory address register and microinstruction 
number 5 brings the data into the Am2903 Q register. Micro- 
instruction number 6 places the contents of the register specified 
by the X1 field into the memory address register so that the 
contents of this memory location can be brought into the Am2903 
during microinstruction number 7. Here, during microinstruction 7 
the contents of the Q register are added to this value and returned 
to the Q register. At microinstruction 8, the Q register is written 
back to the memory location as specified by the contents of the 
register pointed to by the X1 field. This value was already in the 
memory address register because it was used to fetch the 
operand originally at this location. The microcode for this instruc- 
tion is detailed in Figure 12. 


Microinstruction 
Operation 


TO | T1 


PC — MAR; PC + 1 —> PC 
Fetch Inst to IR 

Decode 

PC — MAR; PC + 1 — PC 
MEM ~Q 

(X1) ~ MAR 

MEM +Q-Q 

Q — MEM 


T2 | T3 


X 
X 
X 
X 
X 
X 
X 
X 


Improving Program Control Unit Performance 


If we examine the microcode as shown for the various instruction 
types depicted in Figure 1, we find that all of these microroutines 
have several things in common. First, the very first microinstruc- 
tion simply sets up the memory address register with the current 
value of the program counter. In addition, this microinstruction 
increments the current program counter value. The second mi- 
croinstruction simply fetches the contents of memory and places 
it in the instruction register. The third microinstruction is used to 
decode the microinstruction, select the appropriate micromemory 
word and set it into the pipeline register. Finally, the fourth micro- 
instruction begins actual execution of the desired instruction. In 
all of these examples and using the block diagram of Figure 4, we 
find that a bottle neck occurs in the ALU because of our need to be 
operating on program counter data and operand data intermixed. 
We can improve the performance of the program control unit by 
making the program counter an external register and using a 
multiplexer to select either the program counter or the Am2903 
output to load the memory address register. This is depicted in 
block diagram form in Figure 13. 


The first effect of implementing a program control unit with this 
architecture is that one of the instruction types is shortened by 
one microcycle. This is the register-to-memory-immediate in- 
struction. The new microcode flowcharts for this instruction is 
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Figure 12. Memory Immediate Instruction Microcode. 
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Figure 13. Memory Addressing Scheme with PC Outside of the ALU. 
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shown in Figure 14. In this case, we see that a PC value can be 


placed into the memory address register and the PC incremented 
while the ALU within the Am2903 is being used to perform either a 
pass or an addition. Thus, this architectural change has made 
some improvement in the thru-put of our machine. 


The most important improvement in thru-put realized by the ar- 
chitecture shown in Figure 13 can be seen by evaluating the 
timing for sequential instructions. That is, what happens when 
several instructions are executed sequentially? 


To keep the examples simple, let’s visualize the microcycle timing 
chart for three register-to-register instructions executed sequen- 
tially. The most obvious timing chart would simply be to take the 
register-to-register microinstruction flows as shown in Figure 3 
and concatenate three examples of this timing chart. If we do this, 
we will see that the final execution of the values of Ri + R2 return 
to R1 utilize the ALU, but the program counter is not in operation. 
However, the next microcycle requires placing the program 
counter into the memory address register. Thus, the architecture 
of Figure 13 allows us to do these two micro-operations during the 
same microinstruction. If we assume three register-to-register 
instructions in sequence in memory; let’s call them instruction A, 
B and C; the timing chart of Figure 15 results. What we see in this 
diagram is that the execution of instruction A can be overlapped 
with the set up the program counter in memory address register 
for fetching instruction B. Thus, instead of instruction B starting at 
time T4, it may be started at time T3. This can be accomplished by 
simply having the execution microinstruction also load the MAR 
with the current PC value and increment the PC. From this dis- 
cussion, we can see that instead of twelve microcycle times being 
required to execute three register-to-register instructions, only 
nine microcycle times will be required. We should caution that if 
the reader counts the microcycles in Figure 15, he will arrive at 10 
microcycle times being required. This leads us to our next point. 


If we examine all of the instructions described earlier in this 
application note, we will find that in all cases, the execution of the 
instruction (the last microcycle) can be overlapped with the first 
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(X2) > MAR 

MEM + R1—>R1 

PC > MAR; PC + 17 PC 
MEM + R1—>R1 
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microinstruction of the fetch routine. Thus, the architectural 
change shown in Figure 13 not only allows three of the instruc- 
tions to execute faster during their total microcode, but in fact all 
microinstructions can be executed at least one microcycle faster 
because of the ability to overlap the first microcycle of the fetch 
routine with the execution of the instruction. This architectural 
change therefore saves one or two microcycles depending on the 
instruction. 


In Chapter 9 we will show how further overlapping at the machine 
instruction level can allow us to execute a register-to-register 
instruction during every microcycle, effectively; rather than every 
three microcycles as shown in Figure 15. At the present time, let 
us simply leave the discussion at this point. 


Subroutining 


An implementation technique that is common to the different 
addressing modes is the subroutine (also called stack and link). 
The subroutine allows sections of main program to access a 
common subsection of the program. The general effect is to allow 
less lines of machine code to be written for any given program that 
employs subroutines. 


Figure 16 shows an example of a subroutine within the program. 
The main program executes instructions until it gets to instruction 
52 which is acall to subroutine. This instruction puts address 80 in 
the program counter while saving address 53 in a separate reg- 
ister called Return Register. The program continues on from 
address 80 to address 85 where it encounters the return from 
subroutine command. The return-from-subroutine command 
takes a value out of the return register and puts that into the 
program counter. At that point the program counter continues 
down in the main body of the program until it reaches address 57. 
At this time, another call to subroutine may occur forcing the 
program counter back to the value of 80 while putting the value 58 
into the return address. The subroutine is executed and at ad- 
dress 85 the return command is again encountered. At this point, 
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Figure 14. Register to Memory Immediate Instruction Improved Microcode. 
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Figure 15. Register to Register Instruction with Overlap of Execute and PC Control. 
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Figure 16. Subroutine Execution. 


the subroutine will return control of the program to address 58 of 
the instruction stream and the main program continues to se- 
quence through its instructions. 


In many systems, one subroutine may very well call another 
subroutine which may in turn call yet another subroutine and so 
on. To accomplish this the return address linkage must now be 
“nested” using a last-in first-out (LIFO) stacking arrangement. 
Figure 17 illustrates subroutine nesting. In this example, the main 
program contains a subroutine call or jump-to-subroutine com- 
mand (JSB) at address 53. Program control is passed to the first 
subroutine at address 88, while the return address 54 is placed In 
the stack. At address 89 the of the subroutine 1 another JSB 
command is encountered passing the program control to Sub- 
routine 2 at address 502. The return address value 90 is pushed 
onto the top of the stack. This continues in like fashion for calls to 
Subroutine 3 and 4 with return address 506 and 723 being placed 
on the stack. At address 785 of Subroutine 4, a Return from 
Subroutine (RTS) command ts decoded causing the return ad- 
dress 723 on the top of the stack to be placed in the program 
counter and the contents of the stack are “poped” up one place. 


MAIN 





At address 725 another RTS command ts found, causing the top 
of the stack, address 506, to be placed in the program counter 
and the stack is poped. The identical action occurs for the RTS 
commands at address 507 and 92 such that control is eventually 
returned to the main program and the stack Is empty. 


The LIFO or subroutine stack in the program control hardware is 
shown in Figure 18. When the call from subroutine command ts 
decoded by the computer control unit, the pipeline register out- 
puts cause the stack control to accept the output of the program 
counter register and place it at the top of the stack. Next the 
subroutine address ts brought in from the memory passed 
through the multiplexer and placed in the MAR. The subroutine 
address is also brought through the multiplexer incrementer, 
through the incrementer and placed in the program counter reg- 
ister to be used as a possible next source of address. The sub- 
routine return address is recovered from the stack when the 
pipeline register instructs the stack control logic to place the 
return address at the multiplexer. The return address is passed 
through the multiplexer and clocked into the MAR. The return 
address is also clocked into the PC register via the incrementer 
multiplexer and the incrementer, for use as the next sequential 
address. Figure 19 shows the jump to subroutine instruction and 
Figure 20 shows the microcycles that are used in a typical call to 
subroutine command using the program control hardware shown 
in Figure 18. At TO the program counter is placed into the MAR 
and updated. Time T1 finds the MAR accessing the subroutine 
call instruction, with the instruction being placed into the instruc- 
tion register. At T2 the opcode is decoded by the CCU, and the 
first instruction microcode bits are clocked into the pipeline reg- 
ister. At time T3, the PC is placed in the MAR. At T4 the starting 
address of the subroutine is being fetched and placed into the 
MAR; the stack pointer ts incremented; the current program 
counter is placed on the LIFO stack; and the starting address of 
the Subroutine plus one ts placed into the program counter. 


Figure 21 details the microcycle timing for a return-from-sub- 
routine execution. At time zero the current program counter Is 
placed into the MAR, then incremented by one. During time one 
the contents of the MAR fetches the return from subroutine com- 
mand, which ts then clocked into the instruction register at the end 
of the microcycle. At time 2 the contents of the instruction register 
Is decoded in the CCU with the control bits being clocked into the 
pipeline register. During time 3 the return address on the top of 
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Figure 17. Nested Subroutine Example. 
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Figure 18. Subroutine Stack Architecture. 
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Figure 19. Jump to Subroutine (Branch and Stack) Instruction. 













Microinstruction Microcycle Time 
Operation TO | T1 T4 | T5 | T6 | T7 | T8 | T9 |] 10) Tit} T12 


PC — MAR; PC + 1 > PC 

Fetch Inst to IR 

Decode 

PC + MAR; PC + 1 >PC 
he + MAR; PC > STACK 

MEM + 1 —-PC;SP + 1—-SP 


Figure 20. Branch and Stack Instruction Microcode. 









Microinstruction . Microcycle Time 
Operation TO | T1 | T2 | T3 | T4 | T5 | T6 | T7 | TE | T9 | T10; T11;) T12 


PC +MAR; PC + 1 >PC x 

Fetch Inst to IR xX 

Decode X 
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SP -1->SP 


Figure 21. Return from Subroutine instruction Microcode. 











the LIFO stack is placed into the MAR, while that value plus one is 
stored into program counter. The stack pointer is then 
decremented. 


The basic program control hardware thus developed with some 
embellishments added are contained within the Am2930 program 
control unit as shown in Figure 22. The Am2930 is a 4-bit slice of 
the program control unit. It therefore easily allows the address 
bus to be virtually independent of the data bus in terms of width. 
The Am2930 has a general purpose auxiliary register which has 
two sources and two destinations. One source being the D inputs 
which flow through the R multiplexer and hence into the auxiliary 
register and the other source being the output of the full adder 
which is the second input to the R multiplexer. The two outputs of 
the auxiliary register go to the A and B multiplexers which in turn 
source the A and B inputs to the full adder. The register enable pin 
(RE) allows the auxiliary register to be unconditionally loaded 
from the D Inputs of the Am2930. The A multiplexer selects as its 
sources a logical zero, the output of the auxiliary register, or the D 
inputs. The B multiplexer accepts the outputs of the auxiliary 
register, a logical zero, the output of the subroutine stack file, or 
the output of the program counter register as its sources. 


In the Am2930 design the LIFO stack is 17 words deep, allowing 
up to seventeen levels of subroutine. The LIFO stack is controlled 
by the stack pointer logic which gives a FULL indication when the 
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stack is full and an EMPTY indication when the stack has 
emptied. The input to the LIFO stack is fed through a stack 
multiplexer whose inputs may be D inputs or the output of the 
program counter. Thus, depending upon the application, the 
stack may be used as either a subroutine stack or a general 
purpose LIFO stack which resides on the D bus. The incrementer 
and the full adder are controlled by the Ci and Cn carry-in bits 
respectively. Figure 23 details the ripple carry connections be- 
tween Am2930s in a 16-bit array. The Ci input of the least signifi- 
cant slice (LSS) is controlled from the pipeline register. 


The Ci signal is internally propagated through the incrementer of 
each device using carry look ahead logic. The microprogram 
memory, using the Ci input may now cause the Am2930s to 
repeatedly access the same main memory instruction if so de- 
sired. The full adder has its Cn input tied to ground for the LSS 
device of the Am2930 array. The Cn signal is progagated in 
parallel through the Am2930s. 


For a faster propagation of the Cn signal the interconnection 
shown in Figure 24 should be employed. The generate and 
propagate pins (G, P) of the Am2902A carry look ahead 
generator. The look ahead carries (Cn + x, y, z) are connected to 
the Cn inputs of their respective devices. The output of the 
Am2930 is three-state and is controlled by the output enable pin 
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Figure 22. Am2930 Block Diagram. 
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Figure 23. Ripple Expansion Scheme for Am2930’s. 
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Figure 24. Parallel Look-Ahead Expansion Scheme for Am2930’s. 


(OE). Other features of the Am2930 include an Instruction Enable 
pin (IEN). This pin allows the Am2930 array to be taken off of the 
microprogram data bus thus allowing the bits that were formerly 
committed to the Am2930 to be used in conjunction with other 
devices. The Am2930 also includes a condition code input (CC). 
The Condition Code input permits the conditional testing of a 
single bit. This allows the feasibility of such techniques as condi- 
tional branching at the macroprogram level. For more detailed 
explanation of the Am2930, its instructions and its applications, 
see the Am2930 Data Sheet. Figure 25 shows a typical system 
interconnection using the Am2930. The instruction lines, Ci, RE 
and the OE control pins are connected directly to the outputs of 
the combination microprogram memory and pipeline registers 
contained in the Am24775 devices. The condition code inputs are 
obtained from the Am2904 status and control device, thus allow- 
ing conditional jumps on status. Status from the Am2904 is also 


fed into the test mux for use by the Am2910 for its conditional 
code input. Likewise the full and empty indications from the 
Am2930 are fed into the test MUX for use by the Am2910 to 
ascertain the current status of the stack. If the stack is full and the 
user wishes to push the data onto the stack then the current data 
must be emptied from the stack under microprogram control, 
using additional hardware. 


Another feature of the Am2930 Program Control Unit as shown in 
Figure 22 is the full adder between the program counter and Y 
outputs. This allows for the execution of PC relative addressing 
types of instructions. While this can be an effective addressing 
scheme, it will not be covered in detail in this application note. 


While the Am2930 offers advantages in small high performance 
systems requiring a small LIFO stack, it is not intended to be the 
solution for all program counter requirements. 
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Figure 25. System Interconnection Using the Am2930. 


Using the Am2901A as a Program Control Unit 


Up to this point, the discussion has concerned a general ar- 
chitecture which includes 16 general registers in the ALU section 
and the LIFO stack is a program control section as shown in 
Figure 18. An alternative architecture and that used by most 
general purpose machines, is to place the LIFO stack in main 
memory. The stack pointer for the main memory LIFO stack can 
be contained in the program control unit to be described in this 
section. If the program control unit is built using Am2901A’s it now 
has the capability of using its internal registers as the program 
counter, stack pointer, upper stack bound pointer, lower stack 
bound pointer, and internal temporary registers. This of course 
provides considerable flexibility in the architecture and also al- 
lows for a much greater repertoire of instructions to be executed. 
Particularly, several stack instructions can be included in the 
instruction set, most of which will use the form of the register-to- 
indexed-memory instruction format as shown in Figure 1. 


Another advantage of the architecture shown in Figure 25 is 
speed. The Am2901A’s slightly surpass the Am2903 in speed. 


Thus, a 16-bit Am2901A program control unit architecture can be 
implemented and it will perform well within the microcycle times 
budgeted for the system. 


Looking at Figure 26 which shows the Am2901A used as a 
program control unit and the Am2903 used for the general regis- 
ter stacks/ALU section, we see a three-state buffer on the Y 
outputs of the Am2903 connected to the data bus as well as a 
three-state buffer at the input of the Am2903’s from the data bus. 
This provides isolation and buffering for the bus as well as allow- 
ing appropriate disconnects so that certain microcycles can be 
combined to improve the overall performance of the machine. In 
addition a transfer register is used between the Am2903’s and 
Am2901s to allow a microcycle to be terminated if an ALU opera- 
tion is taking place within the Am2903’s. This provides higher 
performance operation for the machine. In addition, a bi-direc- 
tional buffer (such as the Am8304B) is used between the 
Am2901A Y-outputs and the Am2903 Y-outputs. This gives the 
ability to push the program counter contained in the Am2901A on 
the stack for interrupt handling. In addition, values coming from 
the Am2903 can be placed in the memory address register. 
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Figure 26. PCU Architecture Using the Am2901A. 


Summary 


The thrust of this discussion has been aimed at defining and 
implementing hardware to accomplish addressing of main mem- 
ory. We have shown that a speed advantage is realized if the 
program counter is kept separate from the main general purpose 
register stack/ALU hardware. The most general purpose program 
control unit is the Am2901A. It offers several advantages in terms 
of program control, stack pointer control, and stack pointer 
boundary conditions. The Am2930 can be used in program con- 
trol units occupying less space and including a built-in stack, but 


has some speed and performance limitations. Both devices can 
be used to implement the basic addressing modes associated 
with the instructions described in this application note. 


Another purpose of this application note is to set the stage for 
Chapter 9 where we will overlap machine instructions such that 
register to register instructions can be executed in a single 200ns 
microcycle and the memory reference instructions can be exe- 
cuted in 600ns (3 microcycles) as the effective execution time. 
Also, we will expand on the use of the Am2901A as a Program 
Control Unit. 
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Chapter VI 
Interrupt 


INTRODUCTION 


A digital computer can be viewed as a finite state machine that 
moves from state to state via the execution of a program. Inter- 
rupt mechanisms provide a well-defined way of altering the flow 
of states in response to outside asynchronous events (inter- 
rupts). There is a wide variety of ways of handling interrupts 
depending upon the system requirements. The choice of a par- 
ticular interrupt mechanism can have a large impact on the 
through-put and flexibility of asystem. Therefore, time should be 
spent carefully defining the interrupt mechanism of a new com- 
puter design. 


POLLING VS. NON-POLLING 


One of the simplest ways to handle asynchronous events is the 
polling method. With each possible event there is an associated 
flag that can be accessed by the program. The processor then 
interrogates each flag in order to determine if service is required. 
This method trades simple hardware for software. This not only 
uses memory space but also uses time for polling the flags when 
no service is required. The polling method has low system 
through-put, high real time overhead and slow response time. 


In non-polling systems, the asynchronous event generates an 
interrupt request signal which is passed to the processor. The 
processor in turn suspends the execution of the current process 
and starts execution of an interrupt service routine. When the 
interrupt routine is completed, the processor resumes execution 
of the suspended process. This system is called an interrupt 
driven system because it executes interrupt service routines that 
are initiated by interrupt requests. 


Although the non-polling method requires more hardware, it has 
many advantages. Because the execution of interrupt service 
routines is transparent to the current process, less thought and 
time is required of the programmer of the current process. The 
response time is faster because no time is spent interrogating the 
other non-active interrupts, which in turn increases the system 
throughput. There is less real time overhead and less memory 
space required because only the service routine exists in memory 
and no polling routine is required. 


MACHINE VS. MICROPROGRAM LEVEL INTERRUPTS 


There are two levels on which interrupts may be handled. The 
first and most common is the machine level interrupt. In this 
method possible interrupt requests are checked for during the 
machine instruction fetch cycle. This guarantees that an inter- 
rupt can only happen when a machine instruction is complete 
and before a new instruction starts. 


The second level of handling interrupts is on the microprogram 
level. In the machine levei interrupt system, the microprogram 
has complete control of when to recognize an interrupt but in the 
micrcprogram level system the microprogram can be interrupted 
at any time. This method has a smaller response time for ser- 
vicing interrupt requests but requires that restrictions may be 
placed on the microprogram and the interrupt mechanism. 
These restrictions come from setting aside space on the finite 


microprogram stack in the sequencer for possible interrupt re-_ 


quests. Special consideration may also have to be given to loop 
counters. 


TYPES OF INTERRUPTS 


There are basically four types of interrupts based on the re- 
lationship of the source of the interrupt to the processor: within 
the processor, within the system, between software, and be- 
tween processors. A multiprocessor has to be able to handle all 
four levels of interrupts. Therefore, the interrupt structure that is 
picked will have these design tradeoffs to consider. 


A. Intraprocessor interrupts are those asynchronous events 
that happen within the processor during the execution of a 
machine instruction. This group includes such things as zero 
divide, overflow, accessing restricted memory, execution of 
a privileged instruction, machine failure, etc. 


B. Intrasystem interrupts are interrupts created by system 
peripherals such as disks, CRT’s and printers that require 
service. 


C. Executive interrupts are those interrupts caused by the cur- 
rent program that is executing. This provides a way for the 
current program to make a request of the executive (operat- 
ing system) program. These requests might include such 
things as starting new tasks, allocating hardware resources 
(disks, line printers), communication with other tasks, etc. A 
good example would be the supervisor call (SVC) in the IBM 
360/370 computers. 


D. Interprocessor interrupts include those interrupts between 
two intelligent processors. For example, this class of inter- 
rupts would be used to initiate data and status transfer be- 
tween a local processor and a processor at a remote site. 


SEQUENCE OF EVENTS FOR INTERRUPT HANDLING 


When an interrupt occurs there is a sequence of six events that 
happen. These events, which can be implemented in microcode 
or machine code, integrated together with the hardware com- 
prise the interrupt mechanism. The sequence of events de- 
scribes the steps that occur to provide for a smooth transfer from 
the current process environment to an interrupt servicing envi- 
ronment and back again. The sequence ensures that the proces- 
sor status will be the same immediately after an interrupt is 
serviced as immediately before the interrupt occurred. The 
events listed in the next few paragraphs may differ in order or 
overlap depending upon the machine design and application. 


Interrupt Recognition 


This step consists of the recognition of an interrupt request by 
the processor via an interrupt request line. In this step the pro- 
cessor can determine which device made the request. The 
method that is used to determine which device to service is 
directly related to the interrupt structure of the machine. The 
different types of interrupt structures will be discussed in more 
detail below. 


Save Status 


The goal of this step is to make the interrupt sequence trans- 
parent to the interrupted process. Therefore, the processor saves 
a minimum set of flags and registers that may be changed by the 
interrupt service routine, so that after the service routine is 
finished they may be restored. 


The minimum set of flags and registers would be those which 
will be destroyed in the transfer of control from the current pro- 
cess to the interrupt service routine. It is then the responsibility 
of the service routine to save any other registers which it might 
change. The minimum set of flags and registers might include 
the Program Counter, Overflow Flag, Sign Flag, Interrupt Mask, 
etc. The minimum set also includes any register or flag that 
needs to be saved that the interrupt service routine cannot 
access. 


Interrupt Masking 


This step can overlap some of the other steps. For the first few 
steps of the sequence all interrupts are masked out so that no 
interrupt may occur before the processor status Is saved. The 
mask is then usually set to accept interrupts of higher priority. 
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Some machines allow the service routine to selectively enable 
or disable interrupts also. There may be different variations to 
this step depending upon the application. 


Interrupt Acknowledge 


At some point the processor must acknowledge the interrupt 
being serviced so that the interrupting device knows that it is free 
to continue its task. The processor can acknowledge several 
different ways. One of the ways is to have a line devoted to 
interrupt acknowledge. Another method relies upon the inter- 
rupting device recognizing an acknowledge when the cause of 
the interrupt is serviced. 


Some processor designs also use this signal as a request for the 
interrupting device to send an |.D. down the data bus. This as- 
pect will be discussed in more detail below. 


Interrupt Service Routine 


At this point the processor can call the interrupt service routine. 
The address of the routine can be obtained several ways de- 
pending upon the system architecture. The most trivial is when 
there is only one routine which polls each device to find out 
which one interrupted. Some designs require that the interrupt- 
ing device put an address on the data bus so that the processor 
can store it in its program counter and branch to it. Other de- 
signs use an |.D. number derived from the priority of the interrupt 
and put it through a mapping PROM or look-up table in memory 
in order to obtain the address of the service routine. 


Restore and Return 


After the interrupt service routine has returned via some varia- 
tion of an Interrupt Return instruction, the processor should re- 
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store all the registers and flags that were saved previous to the 
interrupt routine. If this is done correctly, the processor should 
have the same status as before the interrupt was recognized. 


INTERRUPT STRUCTURES 


There are several interrupt structures that can be implemented. 
As usual there is a trade-off between hardware and software (or 
firmware). Listed below are some of the more common struc- 
tures used. The particular structures vary in the way that the 
processor determines which device made the interrupt request. 


Single Request, Multiple Poll 


In this structure there is one request line which is shared among 
all interrupting devices. When the processor recognizes an inter- 
rupt request it polls all the devices to find the interrupting device 
(see Figure 1). Priority is introduced via the order in which the 
devices are polled. This scheme also allows dynamic realloca- 
tion of priority. 


Single Request, Daisy Chain Acknowledge 


In this structure there is one request line which is shared. When 
the processor receives an interrupt it sends out a signal 
acknowledging the interrupt. The acknowledge signal is passed 
from I/O device to I/O device until the interrupting device re- 
ceives the signal. At this point the interrupting device identifies 
itself by putting an I.D. number on the data bus (see Figure 2). 
This structure requires less software, but has a static priority 
associated with each interrupting device. There is also a time 
delay associated with daisy chain acknowledge structure be- 
cause in each device INTA signal has to pass through several 
gate delays. 
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Figure 1. Single Request, Multiple Poll. 
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Figure 2. Single Request, Daisy Chain Acknowledge. 
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Multiple Request 


This structure features one line per priority level (see Figure 3). 
The multiple line structure gives the fastest response time since 
the interrupting device can be identified immediately. It also re- 
sults in simpler interfaces in the peripheral units, in general, a 
single interrupt request flip-flop. This structure allows for the 
possibility of having a mask bit associated with each priority 
level (device). The trade-off of this circuit is a wider bus and a 
limit of one peripheral per priority level. 


Multiple Request, Daisy Chain Acknowledge 


This structure combines the Single Request/Daisy Chain 
Acknowledge with the Muitiple Request structure (see Figure 4). 
For each interrupt request line there is an interrupt acknowledge 
line which is connected to a string of devices in a daisy chain 
fashion. When the appropriate device receives the interrupt 
acknowledge, It puts an 1.D. number on the data bus. 


The advantage of this structure is that a lot (more than available 
interrupt leveis) of devices may be handled by breaking them up 
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into short daisy chains. This gives a shorter access time than a 
pure daisy chain with less hardware than an interrupt request 
line per device. This advantage is that each device must be 
intelligent to pass on the acknowledge signal which requires 
more hardware in each device. 


PRIORITY SCHEMES 


When handling asynchronous requests one must assume that 
sometimes two or more requests can happen simultaneously. In 
order to handle this situation, there must be some sort of priority 
scheme implemented to pick which request is serviced first. 


The two most common priority schemes are the static and the 
rotating structures. In the static structure, all the interrupt levels 
are ordered from the lowest priority to the highest priority. This 
can be fixed in software or hardware and is usually permanent. 


In the rotating structure the possible interrupt requests are ar- 
ranged in a circle. There ts a pointer which points to the lowest 
priority interrupt. The priority of each interrupt increases as one 
travels around the circle, with the highest priority interrupt being 
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Figure 3. Multiple Request. 
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Figure 4. Multiple Requests, Daisy Chain Acknowledge. 
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adjacent to the lowest priority interrupt. The lowest priority inter- 
rupt pointer is changed to point at the interrupt that was just 
serviced. This structure is advantageous when all interrupts 
have similar priority and service bandwidth requirements. 


NESTING 


Nesting allows only higher priority interrupts to interrupt a pro- 
cessing interrupt service routine. Nesting requires fencing off 
equal and lower level interrupts. Fencing requires that the inter- 
rupt structure hold the value of the highest priority interrupt being 
serviced. This can be implemented with a Status Register that 
holds the value as a binary encoded number or in other systems 
as an In-Service Register with a different bit associated with 
each interrupt. 


Whether nesting is performed in microcode or not, all computers 
must have machine instructions to enable and disable interrupts 
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Figure 5. 


and set and clear mask bits. With these instructions, interrupt 
handlers can be written to accomplish nesting of interrupts al- 
though less efficiently than when done with microcode and 
hardware. In low-end computers, the interrupt structure only 
prioritizes interrupts leaving nesting to the software interrupt 
handlers. 


A UNIVERSAL HARDWARE INTERRUPT STRUCTURE 


While designing a hardware interrupt structure, the designer 
should consider the specific functions that are to be achieved. 
This provides for system optimization in not only hardware but 
also software. In the following paragraphs is a step by step 
development of a general purpose interrupt structure as related 
to the design concepts involved. 


Multiple Interrupt Request Handling 


Since interrupt requests are generated from a number of 
sources, the interrupt structures ability to handle interrupt re- 
quests from several sources is important. 


As implemented in Figure 5, the register configuration allows the 
hardware to handle interrupt requests from several sources. The 
first column of registers catches the asynchronous interrupt re- 
quest. The second column of registers synchronizes the re- 
quests with respect to the system. After the interrupt is serviced, 
one of the CLR lines can be used to selectively clear the inter- 
rupt request. 


Interrupt Request Prioritization 


Since the processor can service only one interrupt request at a 
time, the interrupt structure should have the ability to prioritize 
the requests and determine which has the highest priority. As 
shown in Figure 6, a priority encoder can be put on the output of 
the interrupt storage registers. The priority encoder will identify 
the highest interrupt request as a binary encoded number. 


Dynamic Interrupt Request Masking 


The ability to selectively inhibit or “mask” individual interrupt 
requests under program control is desirable. For example at 
times it may be important to inhibit all interrupts except Power 
Failure. As shown in Figure 7 this is realized by ANDing the 
output of a mask register with the output of the interrupt storage 
registers. Therefore, the mask register can be used to select 
which interrupt requests will pass through to the rest of the 
hardware. 


interrupt Request Clearing 


Flexibility in the method of clearing the interrupt allows different 
modes of interrupt system operation. Of particular value are the 
abilities to clear the interrupt currently being serviced or clear all 
interrupts. 
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This is implemented in Figure 8 by use of the Vector Hold reg- 
ister on the output of the Priority Encoder. This register holds the 
latest interrupt request that was recognized. Before another 
interrupt request is recognized, the output of the Vector Hold 
register can be fed through some clear control logic to selec- 
tively clear the old interrupt. 


Interrupt Request Priority Threshold 


The ability to establish a priority threshold is valuable. In this 
type of operation, only those interrupt requests which have 
higher priority than a specified threshold priority are accepted. 
The threshold priority can be defined by microprogram or can be 
automatically established by hardware at the interrupt currently 
being serviced plus one. This automatic threshold prevents mul- 
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This feature is implemented in Figure 8 using an incrementer 
and status register which is compared with the current request. 
Each time an interrupt is recognized, the status register is up- 
dated with one plus the current level. 


Interrupt Service Routine ‘“‘Nesting’”’ 


This feature allows an interrupt service routine for a given pri- 
ority request to be interrupted in turn by a higher priority interrupt 
request. This can be achieved by saving the status register be- 
fore each interrupt is serviced and restoring it afterwards. 


Microprogrammability and Hardware Modularity 


These last two design concepts bring us to the Vectored Priority 
Interrupt controller, the Am2914. The Am2914 is a modular inter- 
rupt system block which is beneficial in two ways. First, 
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Figure 9. Am2914 Block Diagram. 


hardware modularity provides expansion capability. Additional 
modules may be added as the need to service additional re- 
quests arises. Secondly, hardware modularity provides a struc- 
tural regularity which simplifies the system structure and also 
reduces the number of hardware part numbers. 


The Am2914 is microprogrammable, which permits the con- 
struction of a general purpose or “universal” interrupt structure 
which can be microprogrammed to meet a specific application’s 
requirement. The universality of the structure allows standardi- 
zation of the hardware and amortization of the hardware de- 
velopment costs across a much broader user base. The end 
result is a flexible, low cost interrupt structure as shown in 
Figure 9. 


PROGRAMMING THE Am2914 


The Am2914 is controlled by a four-bit microinstruction field Ip-l3. 
The microinstruction is executed if IE (Instruction Enable) is 
LOW and is ignored if IE is HIGH, allowing the four | bits to be 
shared with other functions. Sixteen different microinstructions 
are executed. Figure 11 shows the microinstructions and the 
microinstruction codes. 


In this microinstruction set, the Master Clear microinstruction is 
selected as binary zero so that during a power-up sequence, the 
microinstruction register in the microprogram control unit of the 
central processor can be cleared to all zeros. Thus, on the next 
clock cycle, the Am2914 will execute the Master Clear function. 
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Figure 10. Am2914 Logic Symbol. 
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Figure 11. Am2914 Microinstruction Set. 


This includes clearing the Interrupt Latches and Register as well 
as the Mask Register and Status Register. The LGE flip-flop of 
the least significant group is set LOW because the Group Ad- 
vance Receive input is tied LOW. All other Group Advance Re- 
ceive inputs are tied to Group Advance Send outputs and these 
are forced HIGH during this instruction. This clear instruction 
also sets the Interrupt Request Enable flip-flop so that a fully 
interrupt driven system can be easily initiated from any interrupt. 


The Clear All Interrupts microinstruction clears the Interrupt 
Latches and Register. 


The Clear Interrupts from M-Bus microinstruction clears those 
Interrupt Latches and Register bits which have corresponding 
M-Bus bits set equal to one. 


The Clear Interrupts from Mask Register microinstruction clears 
those Interrupt Latches and Register bits which have cor- 
responding Mask Register bits set equal to one. The M-Bus is 
used by the Am2914 during the execution of this microinstruction 
and must be floating. 


The Clear Interrupt, Last Vector Read microinstruction clears 
the Interrupt Latch and Register bit associated with the last 
vector read. 


The Read Vector microinstruction is used to read the vector 
value of the highest priority request causing the interrupt. The 
vector outputs are three-state drivers that are enabled onto the 
is instruction. This microinstruction also automatically loads the 
value “vector plus one” into the Status Register. In addition, this 
instruction sets the Vector Clear Enable flip-flop and loads the 
Current vector value into the Vector Hold Register so that this 
value can be used by the Clear Interrupt, Last Vector Read 
microinstruction. This allows the user to read the vector as- 
sociated with the interrupt, and at some later time clear the 
Interrupt Latch and Register bit associated with the vector read. 


During the Read Status Register microinstruction, the Status 
Register outputs are enabled onto the Status Bus (S9-S2). The 
Status Bus is a three-bit, bi-directional, three-state bus. 


The Read Mask Register microinstruction enables the Mask 
Register outputs onto the bi-directional, three-state M-Bus. 


The Set Mask Register microinstruction sets all the bits in 
the Mask Register to one. This results in all interrupts being 
inhibited. 


The Load Status Register microinstruction loads S-Bus data into 
the Status Register and also loads the LGE flip-flop from the 
Group Enable input. 


The Bit Clear Mask Register microinstruction may be used to 
selectively clear individual Mask Register bits. This micro- 
instruction clears those Mask Register bits which have cor- 
responding M-Bus bits equal to one. Mask Register bits with 
corresponding M-Bus bits equal to zero are not affected. 


The Bit Set Mask Register microinstruction sets those Mask 
Register bits which have corresponding M-Bus bits equal to one. 
Other Mask Register bits are not affected. 


The entire Mask Register Is cleared by the Clear Mask Register 
microinstruction. This enables all interrupts subject to the Inter- 
rupt Enable flip-flop and the Status Register. 


All Interrupt Requests may be disabled by execution of the Dis- 
able Interrupt Request microinstruction. This microinstruction 
resets an Interrupt Request Enable flip-flop on the chip. 


The Load Mask Register microinstruction loads data from the 
three-state, bi-directional M-Bus into the Mask Register. 


The Enable Interrupt Request microinstruction sets the Interrupt 
Enable flip-flop. Thus, Interrupt Requests are enabled subject to 
the contents of the Mask and Status Registers. 


Am2914 BLOCK DIAGRAM DESCRIPTION 


The Am2914 block diagram is shown in Figure 9. The Micro- 
instruction Decode circuitry decodes the Interrupt Microinstruc- 
tions and generates required control signals for the chip. 


The Interrupt Register holds the Interrupt Inputs and Is an 
eight-bit, edge-triggered register which 1s set on the rising edge 
of the CP Clock signal if the Interrupt Input is LOW. 


The Interrupt latches are set/reset latches. When the Latch 
Bypass signal is LOW, the latches are enabled and act as nega- 
tive pulse catchers on the inputs to the Interrupt Register. When 
the Latch Bypass signal is HIGH, the Interrupt latches are 
transparent. 


The Mask Register holds the eight mask bits associated with the 
eight interrupt levels. The register may be loaded from or read to 
the M-Bus. Also, the entire register or individual mask bits may 
be set or cleared. 


The Interrupt Detect circuitry detects the presence of any un- 
masked Interrupt Input. The eight-input Priority Encoder deter- 
mines the highest priority, non-masked Interrupt Input and forms 
a binary coded interrupt vector. Following a Vector Read, the 
three-bit Vector Hold Register holds the binary coded inter- 
rupt vector. This stored vector can be used later for clearing 
interrupts. 


The three-bit Status Register holds the status bits and may be 
loaded from or read to the S-Bus. During a Vector Read, the 
Incrementer increments the interrupt vector by one, and the re- 
sult is clocked into the Status Register. Thus, the Status Reg- 
ister points to a level one greater than the vector just read. 
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The three-bit Comparator compares the Interrupt Vector with the 
contents of the Status Register and indicates if the Interrupt 
Vector Is greater than or equal to the contents of the Status 
Register. 


The Lowest Group Enabled Flip-Flop is used when a number of 
Am2914’s are cascaded. In a cascaded system, only one Low- 
est Group Enabled Flip-Flop is LOW at a time. It indicates the 
eight interrupt group, which contains the lowest priority interrupt 
level which will be accepted and is used to form the higher order 
status bits. 


The Interrupt Request and Group Enable logic contain various 
gating to generate the Interrupt Request, Parallel Disable, Rip- 
ple Disable, and Group Advance Send signals. 


The Status Overflow signal is used to disable all interrupts. It 
indicates the highest priority interrupt vector has been read and 
the Status Register has overflowed. 


The Clear Control logic generates the eight individual clear sig- 
nals for the bits in the Interrupt Latches and Register. The Vector 
Clear Enable Flip-Flop indicates if the last vector read was from 
this chip. When it is set it enables the Clear Control Logic. 


The CP clock signal is used to clock the Interrupt Register, Mask 
Register, Status Register, Vector Hold Register, and the Lowest 
Group Enabled, Vector Clear Enable and Status Overflow Flip- 
Flops, all on the clock LOW-to-HIGH transition. 


CASCADING THE Am2914 


A number of input/output signals are provided for cascading the 
Am2914 Vectored Priority Interrupt Encoder. A definition of 
these I/O signals and their required connections follows: 


Group Signal (GS) — This signal is the output of the Lowest 
Group Enabled flip-flop and during a Read Status micro- 
instruction is used to generate the high order bits of the Status 
word. 


Group Enable (GE) — This signal is one of the inputs to the 
Lowest Group Enable flip-flop and is used to load the flip-flop 
during the Load Status microinstruction. 


Group Advance Send (GAS) — During a Read Vector micro- 
instruction, this output signal is LOW when the highest priority 
vector (vector seven) of the group is being read. In a cascaded 
system Group Advance Send must be tied to the Group Ad- 
vance Receive input of the next higher group in order to transfer 
status information. 


Group Advance Receive (GAR) — During a Master Clear or 
Read Vector microinstruction, this input signal is used with other 
internal signals to load the Lowest Group Enabled flip-flop. The 
Group Advance Receive input of the lowest priority group must 
be tied to ground. 


Status Overflow (SV) — This output signal becomes LOW after 
the highest priority vector (vector seven) of the group has been 
read and indicates the Status Register has overflowed. It stays 
LOW until a Master Clear or Load Status microinstruction is 
executed. The Status Overflow output of the highest priority 
group should be connected to the Interrupt Disable input of the 
same group and serves to disable all interrupts unti| new status 
is loaded or the system is master cleared. The Status Overflow 
outputs of lower priority groups should be left open (see Fig- 
ure 14). 


Interrupt Disable (ID) — When LOW, this input signal inhibits the 
Interrupt Request output from the chip and also generates a 
Ripple Disable output. 


Ripple Disable (RD) — This output signal is used only in the 
Ripple Cascade Mode (see below). The Ripple Disable output is 
LOW when the Interrupt Disable input is LOW, the Lowest 
Group Enabled flip-flop is LOW, or an Interrupt Request is gen- 
erated in the group. In the ripple cascade mode, the Ripple 
Disable output ts tied to the Interrupt Disable input of the next 
lower priority group (see Figure 13). 


Parallel Disable (PD) — This output is used only in the parallel 
cascade mode (see below). It is LOW when the Lowest Group 
Enabled flip-flop is LOW or an Interrupt Request is generated in 
the group. It is not affected by the Interrupt Disable input. 


CASCADING CONFIGURATIONS 


A single Am2914 chip may be used to prioritize and encode up to 
eight interrupt inputs. Figure 12 shows how the above cascade 
lines should be connected in such a single chip system. 
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Figure 12. Cascade Lines Connection for 
Single Chip System. 


The Group Advance Receive and Group Enable inputs should 
be connected to ground so that the Lowest Group Enabled flip- 
flop is forced LOW during a Master Clear or Load Status micro- 
instruction. Status Overflow should be connected to Interrupt 
Disable in order to disable interrupts when vector seven Is read. 
The Group Advance Send, Ripple Disable, Group Signal and 
Parallel Disable pins should be left open. 


The Am2914 may be cascaded in either a Ripple Cascade Mode 
or a Parallel Cascade Mode. In the Ripple Cascade Mode, the 
Interrupt Disable signal, which disables lower priority interrupts, 
is allowed to ripple through lower priority groups. Figures 13, 16, 
and 17 show the cascade connections required for a ripple cas- 
cade 32 input interrupt system. 


In the parallel cascade mode, a parallel lookahead scheme is 
employed using the high-speed Am2902 Lookahead Carry 
Generator. Figures 14, 15, and 17 show the cascade‘connections 
required for a parallel cascade 32-input interrupt system. For this 
application, the Am2902 1s used as a lookahead interrupt disable 


215 


generator. A Parallel Disable output from any group results in the 
disabling of all lower priority groups in parallel. Figure 15 shows 


° the Am2902 logic diagram and equations. 
STATUS OVERFLOW 
INTERRUPT {0 In Figures 16 and 17 the Am2913 Priority Interrupt Expander is 


INTERRUPT 
REE nite ete eg shown forming the high order bits of the vector and status, re- 
aoe spectively. The Am2913 is an eight-line to three-line priority en- 
DISABLE coder with three-state outputs which are enabled by the five 
output control signals G1, G2, G3, G4, and G5. In Figure 16, the 
Am2913 is connected so that its outputs are enabled during a 
Read Vector instruction, and in Figure 17 the Am2913 is con- 
ere nected to microinstruction bits so that its outputs are enabled 
Am2914 ’ during a Read Status [nstruction. The Am2913 logic diagram and 

truth table are shown in Figure 18. 


CLOCK 


The Am25LS138 three-line to eight-line Decoder also is shown in 
Figure 17. It is used to decode the three high order status bits 
during a Load Status instruction. The Am25LS138 logic diagram 


and truth table are shown in Figure 19. 
INTERRUPT IRQ 8-15 
REQUEST Am2914 
OUTPUT 


Am2914 IN THE Am2900 SYSTEM 


The block diagram of Figure 20 shows a typical 16-bit mini- 
computer architecture. The Am2914 Is the heart of the Interrupt 
Control Unit as shown at the bottom of the block diagram. It 
receives its microinstructions from the Computer Control Unit. 
The mask, Status and Interrupt vector information are passed on 
the data bus. The interrupt request line from the Am2914 input 
into the next microprogram Address Control unit where it can be 
tested to determine if an interrupt request has been made. 


Am2914 





Figure 13. Interrupt Disable Connections for Figures 21 and 22 show the detailed hardware design of two 
Ripple Cascade Mode. example interrupt control units (ICU’s) for an Am2900 Computer 
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Figure 14. Interrupt Disable Connections for Parallel Cascade Mode. 
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Ch+x = Go + PoCp 


Cr+y 7 G, + P1Go + P,P9C, 
Cn+z > Go + P2G, + PoP,Go + P5P;PoC, 
= G3 + P3Go + P3P5G, + P3P5P;Go 


= P3P2P,Pp 





Figure 15. Am2902 Carry Look-Ahead Generator Logic Diagram and Equations. 
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Figure 16. Vector Connections for both the Parallel and Ripple Cascade Modes. 


System. Figure 21 shows an eight interrupt level ICU, and Figure 
22 shows an ICU which has sixteen levels. In both designs, the 
Am2914 Instruction inputs and Instruction Enable input are driven 
by the lo.3 field and IE bit, respectively, of the Microinstruction 
Register. Note that Am2914 Instruction inputs are enabled only 
when the IE bit is LOW. Therefore, the Ip.3 field of the Micro- 
instruction Register may be shared with another functional unit 
of the computer such as the ALU. 


The Latch Bypass input is shown connected to ground so that a 
Low-going pulse will be detected at any of the Interrupt Inputs. 
The designer has the option of connecting the Latch Bypass input 
to a pull up resistor connected to +5 volts. This makes the inputs 
low level sensitive. They are clocked in by each system clock. Its 
therefore implied that the processor will have to acknowledge the 
interrupt so that the interrupting device will know when to release 
the interrupt request line. 
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Figure 17. Group Signal, Group Enable, Group Advance Send, Group Advance Receive and Status Connections for Both 
the Parallel and Ripple Cascade Modes. 
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Figure 18. Am2913 Priority Interrupt Expander Logic Diagram and Truth Table. 
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Figure 20. A Generalized Computer Architecture. 
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Figure 21. 8 Level Interrupt Control Unit for Am2900 System. 
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Figure 22. 16 Level Interrupt Control Unit for Am2900 System. 
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In Figures 21 and 22, the Status and Mask inputs/outputs are 
connected to the data bus in a bi-directional configuration so that 
Status and Mask Registers may be loaded from or read to the 
data bus with appropriate Am2914 instructions. This gives the 
designer two possibilities which could be very advantageous. 


Number one is the ability to store the Status and Mask information 
on a stack in memory. This is very advantageous when doing 
nested interrupts. Secondly, it allows the designer to construct 
machine instruction that can modify these two registers. This is 
very Important to the system programmer who is involved in 
writing software to manage the interrupts. 


For the eight level ICU of Figure 21, the Status Overflow output is 
connected to the Interrupt Disable input, and the Group Advance 
Receive and Group Enable inputs are connected to ground, as 
previously described. 


For the 16 interrupt level ICU of Figure 22, the Parallel Disable 
output of the higher priority group serves as the high order vector 
bit. An Am2913 Priority Interrupt Expander is gated by the 
Am2914 instruction lines so that its output is enabled only during a 
Read Status instruction, and is used to encode the high order bit 
of the status. An inverter suffices to decode the high order bit of 
the status bit during a Load Status instruction. As described 
previously for a ripple cascade system, the Group Advance Re- 
ceive input of the next higher priority group; the Ripple Disable 
output is connected to the Interrupt Disable input of the next lower 
priority group; the Status Overflow output of the highest priority 
group is connected to the Interrupt Disable input of the same 
group, and the Group Advance Receive input of the lowest priority 
group is connected to ground. 


In both designs, two Am29751 32-word by 8-bit PROM’s with 
three-state outputs are used to map the Am2914 Vector outputs 
into a 16-bit address vector. The PROM outputs are connected to 
the data bus. When a Read Vector Instruction (Am2914) is exe- 
cuted, the address vector is available to be used either as the 
address of the next instruction or a location to find the address of 
the next instruction to execute. 


Figure 23 shows a design where the address vector from the 
mapping PROM can be clocked into a register in the Am2903’s. 
The registers in the Am2903’s would be split between general 
purpose, scratch, stack pointers and Program Counter registers. 


The address vector also may be gated directly to the “D” inputs of 
the Am2911 Microprogram Sequencer as shown in Figure 24, 
and used as the start PROM address of a microinstruction inter- 
rupt service routine. This method would be most useful in a 
controller application. This method would trade faster service for 
a bigger microprogram that accommodates all the code to service 
each individual interrupt. 


FIRMWARE EXAMPLE FOR Am2914 INTERRUPT SYSTEM 


The software for handling interrupt requests is on two levels. 
The first level to come into play is the microprogram level. This is 
the level at which the request is recognized and the program 
counter is manipulated to start execution of a machine level 
interrupt service routine which is the second level. When the 
machine level interrupt service routine is finished, some form of 
a Return Interrupt instruction is executed. The microcode for the 
return instruction manipulates the program counter so that 
execution of the current machine program previous to the re- 
quest is restored as shown in Figure 25. 


This example is concerned with the microprogram level. This 
microcode goes along with the hardware shown in Figure 23. In 
this example the code is shown in the form of Flow Charts be- 


cause the actual microprogram format will vary from machine to 
machine. 


The important features to notice that have a direct relevance to 
the firmware are the Latch Bypass and where the Mask, Status 
and Vector busses go. For this example, the Latch Bypass is 
LOW making the Interrupt Latches latch up on a negative going 
pulse. The Mask and Status busses go to the data bus allowing 
the Status and Mask data to be transferred to and from memory. 
The Vector bus passes through a mapping PROM to the data 
bus where it can be read into the Program Counter contained in 
the Am2903’s. The PROM contains addresses of service 
routines which correspond to the different interrupt levels. 


Another relevant fact, important to understanding the firmware is 
that the interrupt mechanism is limited to handle interrupts on 
the machine level. 


As shown in Figure 26a, the first thing that happens in the fetch 
routine (written in microcode) is a conditional subroutine call that 
will be taken if an interrupt request is present. This happens 
before the current machine instruction is fetched and the pro- 
gram counter is incremented. 


In the Interrupt routine (shown in Figure 26b) a microprogram 
subroutine is first called to push the program counter onto the 
system stack. This is done so that the program counter can be 
restored in order to resume execution of the machine program 
after the interrupt service routine is done. The next thing that is 
saved on the system stack is the contents of the Am2914 Status 
Register. This is done because the status register which contains 
the priority level that would be serviced prior to the interrupt, will 
be restored after the interrupt is serviced. This maintains anested 
interrupt structure (fence). 


After saving the program counter and status register, the vector 
is read out of the Am2914 through the mapping PROM to obtain 
the address of the machine interrupt service routine. The ad- 
dress is then read into the program counter which resides in the 
Am2903’s. When the Vector is read, the interrupt request priority 
plus one is automatically put into the status register by the 
Am2914 so that all interrupt requests of lower priority than the 
one being serviced are ignored. This is often referred to as 
moving the fence up. Since the vector has been read and the 
new address is in the program counter, the interrupt request can 
be cleared from the interrupt register via the Clear Interrupt/Last 
Vector Read instruction. At this point a jump is made to the 
Fetch routine which will now fetch the first instruction of the 
machine Interrupt Service routine. 


The last instruction that the machine level interrupt service exe- 
cutes is an Interrupt Return. This will in turn call Return Interrupt 
microprogram. The status is first popped off the system stack 
and loaded back into the status register. This restores the Inter- 
rupt Fence. The program counter is then popped off the system 
stack and loaded into the program counter register. This re- 
stores the program counter to point to the instruction that was 
going to be executed when the interrupt request occurred. 


TIME DELAY WHEN USING THE Am2914 


An aspect that should be covered when using any part is how it 
will fit into the system timing; because the cycle time of the 
system will be as long as the longest delay path in the machine. 
Shown in Figure 27 is the longest delay path through the 
Am2914 for the previous 16-bit computer example. The calcula- 
tions were using both typical and worst case values at 25°C and 
5.0V. 


The longest delay path for the system where the vector from the 
mapping PROM feeds into the “D” inputs of the Am2910 is 
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Figure 23. Example of a 16-Bit Computer #1. 
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Figure 25. Machine Level Instruction Flow During Interrupt 
Request. 
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LOAD MAR —— PC 
JSB MEMREAD 
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Figure 26a. Flow Chart for a Simplified Microprogram Fetch 
Routine. 
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JMP FETCH 


Figure 26b. Call Interrupt Service Routine Microprogram 
Flow Chart. 





JSB POP 
LOAD STATUS 


JSB POPPC 


JMP FETCH 





Figure 26c. Return Interrupt Microprogram Flow Chart. 
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Figure 27a. AC Calculations. 
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Figure 28h. 


shown tn Figure 28. This path is much longer because of the two 
PROM’s that have to be accessed. Therefore, there may be a 
trade-off of slightly longer system cycle time for faster service of 
interrupts via service routines in microcode. 


For some systems the delay time shown in Figure 28b may be 
too long. Therefore, the designer can split the delay time into 
parts by putting a register between the Am2914 and the mapping 
PROM as shown in Figure 28c. When done In two system clock 
cycles, the delay time will be as shown in Figure 28f. 


Figure 28d shows the delay path from the Interrupt Request 
Register through the Condition Code MUX to the Am2910. The 
time calculations are shown in Figure 28g. Again, for some sys- 
tems, this path may be too long. Therefore, as shown above, this 
path may be broken tn two, which is shown in Figure 28e. This 
will result in two system clock cycles. The delay involved in each 
cycle is shown in Figure 28h. 


ANOTHER EXAMPLE OF Am2900 SYSTEM 
USING THE Am2914 


As shown in Figure 29, this example varies in the way that the 
interrupt request 1s recognized by the microprogrammed 


machine. In this example the interrupt request line for the 
Am2914 enables or disables the MAP signal going to the map- 
ping PROM. When an interrupt request is present and a Jump 
Map instruction is executed, the output of the mapping PROM 
remains tri-stated; and the bus connected to the “D” inputs of 
the Am2910 is HIGH because of the pull-up resistors. Therefore, 
the microprogram will start executing at the highest location in 
microprogram memory when an interrupt request is present. At 
this location a Jump Instruction to the microprogram interrupt 
service routine could be placed. The microcode is written so that 
the only time a Jump Map instruction is executed is at the end of 
the Fetch microprogram routine as shown in Figure 30a. 


In the previous example the interrupt request was recognized 
before the program counter is incremented after which the Jump 
Map instruction is executed. When the Jump Map is executed, 
either the instruction is executed or an interrupt request Is ser- 
viced. Therefore, when the Return Interrupt machine instruction 
is executed, the program counter needs to be backed up via 
microcode, as shown in Figure 30b, in order to refetch the 
machine instruction which was lost. This also dictates that the 
program counter have a path to an incrementer/decrementer or 
ALU, which in this example is handled by putting the program 
counter in the Am2903’s. 


MICROPROGRAM LEVEL INTERRUPT EXAMPLE 


Some high-speed control applications require extremely fast 
interrupt response. While it may ordinarily be desirable to com- 
plete an entire processing sequence (such as executing a mi- 
croprogram for a macroinstruction) prior to testing for the inter- 
rupt and allowing it to occur, it is not always possible to achieve 
the required interrupt response time desired. If this is the case, 
microinstruction level interrupt handling must be employed. The 
technique described below has a maximum latency of three mi- 
crocycles which can be 450-600ns total. Implementation is 
straightforward using the Am2910 Microsequencer, a 40-pin LSI 
device that can control 4096 words of microprogram at a 150ns 
cycle time, and a few extra MSI and SSI packages. In this appli- 
cation, the Am2910 is configured in its standard architecture. 
The additional logic does not influence the normal system cycle 
time. 


If microlevel interrupt handling is to be employed, logic must be 
provided to generate a substitute microprogram address corre- 
sponding to the location of the interrupt service routine. In the 
event of a microlevel interrupt, the sequencer address outputs 
are tri-stated and the substitute address is placed on the micro- 
program address bus, causing the next microinstruction fetch to 
be determined by the interrupt control vector generator. While 
this is happening, steps must be taken with the Am2910 to in- 
sure that the interrupted routine can be properly restored. To 
understand this procedure, it will be necessary to examine the 
Am2910 in more detail. 


Referring to Figure 31, the microprogram address bus is driven 
by the Y outputs of the Am2910 through a tri-state buffer than 
can be disabled by means of the OE input. The address is 
selected in a multiplexer from a direct input, from a register/ 
counter, from a push/pop stack, or from a microprogram counter 
register. The microprogram counter register is commonly used 
as the address source when executing the next microinstruction 
in sequence. Whenever an address appears at the multiplexer 
outputs, it is incremented and presented to the microprogram 
counters inputs. At the rising edge of the clock, this new address 
that 1s current address-plus-1 1s loaded into the microprogram 
counter and a microprogram access begins at this address. 
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JSB POP 
LOAD STATUS 


JSB POPPC 
LOAD PC PC -1 
JMP FETCH 


Figure 30a. Return Interrupt Microprogram 
for Second Example. 
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Figure 30b. Fetch Microprogram for the Second Example. 
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Figure 31. Am2910 Block Diagram. 
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Note that at this time, whatever was fetched at the previous 
address was loaded into the microword register for execution. 
Thus, the microprogram sequencer is always looking for the 
address of the next microinstruction to be executed (while a 
previously fetched microinstruction is residing in the microword 
register). Subroutine and microprogram loops may be ac- 
complished by using the stack and the register counter. Re- 
gardiess of what is selected as source of next address, the 
selected address will be incremented and presented to the 
microprogram counter. So to accomplish a microprogram 
branch, one would simply select the D inputs for a branch ad- 
dress for one cycle, then the next address source could be 
switched back to the program counter on the next cycle which 
would then contain the branch address plus 1. 


This is a carry input to the incrementer which is normally tied 
HIGH. In the case of a microlevel interrupt, the microprogram 
sequencer will not determine the address of the next microin- 
struction to be executed. Instead the sequencer output will be 
tri-stated and a substitute address will be placed on the bus. The 
sequencer continues to operate in a normal fashion with its mul- 
tiplexer output being incremented and presented to the micro- 
program counter register. It must now be noted that the instruc- 
tion located at the address then coming out of the multiplexer 
outputs will not be executed but rather the next microinstruction 
to be executed will be determined by the interrupt vector 
generator. It would therefore, be wrong to increment this micro- 
program address but rather it must be saved intact in order to 
push it onto the stack for access during interrupt return. This is 
easily accomplished in the Am2910 by grounding the carry input 
to the incrementer simultaneously with three-stating the se- 
quencer output. Then the multiplexer output will be stored in the 


TEST 
CONDITIONS 


(OUTSIDE) 


SYNC AND 
[> CONTROL 
LOGIC 


SYSTEM 
CLOCK 


INTERRUPT 
CONTROL 
AND 
VECTOR 
GENERATOR 


microprogram counter register and on the next microcycle the 
Am2910 must be told to push in order to preserve this address 
on the stack. 


This carry-in input is all important and exists on all Advanced 
Micro Devices’ microprogram sequencers. Unless the carry-in is 
grounded, whatever address was in the multiplexer output when 
the sequencer output was tri-stated is incremented and an in- 
struction is missed in the interrupted routine. This, of course, 
would likely be disastrous. The key to this microinterrupt 
technique is that the address of the unexecuted instruction 
(when the Am2910 was tri-stated and a substitute address 
supplied) is preserved by inhibiting the increment via the carry 
input, so the address is passed on intact to the microprogram 
counter. If the microinterrupt is to be more than one cycle long, 
the microprogram counter must be pushed so as to save the 
return address. Otherwise, a “continue” may be used to return 
from the interrupt on the very next cycle. In this event the mic- 
rointerrupt effectively inserts one instruction in the stream. 


Figure 32 is the block diagram of a hardware design that imple- 
ments the above concept. The SYNC/CONTROL and INTER- 
RUPT CONTROL/VECTOR GENERATOR logic are shown in 
detail in Figure 33. Part of the Am2918 and both ‘LS74 Flip- 
Flops are used to synchronize the recognition of the asynchron- 
ous interrupt request as shown in Figure 34. The interrupt re- 
quest arrives at the interrupt input. On the next clock cycle it is 
clocked into the Am2918. In the following clock cycle a pulse 
that is one system clock cycle long is put out by the flip-flop pair 
FF1 and FF2. The pulse is used to disable the carry input of the 
Am2910, tri-state the output of the Am2910, and enable the 
jump vector onto the input of the PROM. The vector indexes into 
a table in microprogram memory that contains “JUMP SUB- 
ROUTINE” instructions to different interrupt service routines. 
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uWORD REGISTER < 


ZY 
PEEL: 


ead 


CONTROLS TO SYSTEM (OR OUTSIDE) 





Figure 32. Computer Control Unit Set-up for High-Speed Micro-Level Interrupt Handling. Latency is a Maximum 


of Two Microcycles (i.e., about 300 to 500ns). 
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Figure 33. Example of Sync Control Logic and Vector Generator. 
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Figure 34. Timing of Vector Generator and Sync Control Logic. 
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Figure 35. Interrupt Sequence Timing. 
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Figure 36. Return-From-interrupt Sequence Timing. 





Figure 35 shows how the interrupt sequence timing fits into the 
normal flow of microprogram address in the Am2910. Note how 
the stack is used. This demonstrates the need for always re- 
serving room on the stack to allow for interrupts. This applies to 
any room that the interrupt service routine may require as well 
as the return address. This limitation may require that only one 
interrupt request be serviced at a time. 


Figure 36 shows how the return from the interrupt service 
routine fits into the microprogram flow. Notice that a Return 
instruction is used to accomplish this. 


SUMMARY 


In this chapter, Interrupts were discussed beginning with a def- 
inition of the Interrupt Mechanism and proceeding to a classifi- 
cation of different interrupts and how they are handled. A dis- 


cussion of the concepts that go into designing the “Universal 
interrupt” hardware was given which culminated with the 
Am2914. The chapter ends with several Interrupt Mechanism 
applications using the Am2914 and Am2910. 


In this chapter it was shown how interrupts can be handled using 
parts from the Am2900 family. Because of their hardware mod- 
ularity and universal architecture, they may be used in a variety 
of applications. Since the Am2900 Family parts are micropro- 
grammable, they allow the user’s system to grow with time as 
system requirements change. Together these attributes make 
the Am2900 Family the flexible cost effective family that it is. 
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Introduction 


The transfer of data between the microcomputer and the 
peripheral devices is generally referred to as Input/Output (I/O). 
What is desired is a high speed technique of transferring data 
between the peripherals and the memory. Generally speaking, 
there is a minimum of three types of I/O. These are, Programmed 
(0, Memory Mapped I/O and Direct Memory Access 1/O. All of 
these schemes are common in today’s currently available 
minicomputers. A basic understanding of these I/O techniques is 
helpful in fully comprehending DMA. The first two of these types 
of I/O can be interrupt driven. Thatis, programmed I/O or memory 
mapped I/O can be initiated by an interrupt from the peripheral 
device. 


Programmed 1/O 


in this type of I/O, all operations are controlled by the CPU 
program. In other words, the peripheral device performs the 
functions of inputting or outputting data as it is controlled by the 
CPU. Normally, the machine will include a set of I/O instructions 
which are used to transfer data to or from the peripheral devices 
via an Input/Output port. All data for the peripheral devices 
passes through these I/O ports to the CPU and the resources of 
the CPU must be utilized in order to effect an I/O transfer. Figure 1 
shows the Block Diagram of a programmed I/O system used ina 
typical microcomputer. Figure 2 shows an example of that portion 
of the program used to output data to the peripheral device. 


CPU ; MEMORY 


VO PORT 


VO DEVICES 


MPR-550 





Figure 1. Programmed I/O System. 










CPU Program Comments 


Load R, M Load CPU Register R with the Contents of 
Memory Address M 
Out D, R Transfer the Contents of CPU Register R to 


i/O Device D via the I/O port. 


Figure 2. Example Output Program — Programmed |/O. 


Programmed I/O ts simple to implement and does not require the 
utilization of any memory addresses for its realization. In addition, 
special instructions are available to the programmer to execute 
the peripheral data transfers. Programmed I/O is also low cost 
relative to other types of I/O; however, it has the following disad- 
vantages. Since I/O device operation is asynchronous with re- 


spect to CPU operation, the CPU has no way of knowing when a 
peripheral device is ready to transfer data and must periodically 
poll the device to determine its readiness. This results in an 
inefficient I/O transfer. Also, since the CPU must be used to effect 
the I/O transfer, the CPU resources are tied up during the time of 
transfer and the time of polling and cannot be used for other tasks. 
For these reasons, Programmed I/O is generally limited to use 
with low speed devices. 


Perhaps, one of the best known programmed I/O microcom- 
puters in the industry today is the Am9080A. This device features 
two instructions for either inputting data or outputting data to any 
one of 256 Input/Output ports. 


Memory Mapped 1/O 


Memory Mapped 1/O is a technique whereby the transfer of data 
to and from peripheral devices is accomplished by using some of 
the normally available memory space. In this technique, memory 
addresses are decoded within the peripheral devices and are 
thus used to determine when a specific device is being ad- 
dressed. Usually, each type of function within the peripheral 
device is assigned a memory address and can then be accessed 
by the CPU. For example, the peripheral device may contain a 
command register, a status register, a data in register and a data 
out register. Thus, four memory addresses might be utilized in 
performing I/O to this peripheral. Figure 1 is also the block dia- 
gram for a Memory Mapped I/O scheme. 


The chief advantage of Memory Mapped |/O is that all of the 
memory reference instructions are usually available to perform 
the I/O function. Consequently, no special I/O instructions are 
required in the machine. The key disadvantage of this technique 
is that a block of the memory addressing range must be set aside 
for assignment to the peripheral devices. Thus, the overall mem- 
ory addressing range of the machine is reduced by the size of this 
block. Again, the resources of the CPU are tied up while the I/O is 
being performed. A well known machine using only Memory 
Mapped I/O is the PDP 11. In it the upper 4k of memory space Is 
usually used for the I/O devices. 


Interrupt Driven I/O 


Interrupts are means by which a peripheral device can stop the 
normal flow of the CPU instruction execution and force the CPU to 
temporarily suspend its current program. Then, the program 
“jumps” to a different program which executes an 1/O transfer. 
Typically, this eliminates the need for polling the peripheral de- 
vices to determine if an I/O transfer is ready. Thus, the interrupt 
driven scheme provides a more efficient \/O transfer technique. 
However, there is an overhead burden associated with interrupts 
in that the CPU must store away and later restore all of the 
parameters required to resume the interrupted program. This 
overhead degrades the CPU performance. Depending on the 
overall interrupt structure, the CPU still may have to do some 
polling of devices which may be tied to the same interrupt level. 


It should be pointed out that both Programmed I/O and Memory 
Mapped can take advantage of the interrupt technique. Thatis, an 
interrupt can be used to initiate the peripheral data transfer in 
either type of system. The CPU still must control the transfer of 
the data between the memory and the peripheral device and the 
CPU resources are unavailable for executing other instructions 
during this time. 


What is DMA? 


DMA is a technique for data transfer which provides a direct path 
between the I/O device and the memory without CPU interven- 
tion. With this path, a peripheral device has “Direct Memory 
Access” and can transfer data directly to or from the memory. The 
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Figure 3. DMA I/O System. 


purpose of the DMA is to relieve the CPU of the task of controlling 
the I/O transfer, thereby freeing it to perform other tasks during 
this time, and to provide a means by which data can be transfer- 
red between an I/O device and memory at very high speed. 
Figure 3 shows the Block Diagram of a system where several I/O 
devices can perform DMA transfers into memory. Note that the 
CPU and peripheral devices share a common bus to the memory 
and that the CPU and peripheral devices cannot access memory 
during the same cycle. DMA can also be designed to perform 
memory-to-memory transfers or |/O-to-I/O transfers. 


Several DMA transfer methods exist, such as the CPU halt 
method, the memory timeslice method, and the “cycle steal” 
method. In the CPU halt method, the CPU is halted and switched 
off the bus while a DMA transfer occurs. This is the most 
straightforward method. However, it takes a relatively long time to 
switch the CPU on and off the bus, and the CPU cannot do 
anything during the transfer. 


The memory timeslice method works by splitting each memory 
cycle into two timeslots; one is reserved for the CPU and the other 
for DMA. This method provides the highest CPU execution rate 
as well as the highest DMA transfer rate because both the CPU 
and DMA are guaranteed access to memory during every mem- 
ory cycle. The disadvantage of this method is that high speed, 
costly memories must be used. 


The “cycle steal” method is a cost/performance compromise 
between the low cost of the CPU halt method and the high 
performance of the memory timeslice method. Cycle stealing 
refers to a DMA device “stealing” a CPU memory cycle in order to 
execute a DMA transfer. CPU program execution continues dur- 
ing the DMA transfer (the CPU is not halted), resulting in an 
overlap of CPU program execution with DMA transfer. If the CPU 
and a DMA device require a memory cycle at the same time, 
priority is granted to the DMA device and the CPU waits until the 
DMA cycle is completed. DMA causes CPU performance degra- 
dation only in those applications where the CPU uses the entire 
memory bandwidth. In many applications the CPU is slow relative 
to memory cycle time and “cycle stealing” provides satisfactory 
performance at relatively low cost. 


How is DMA Implemented? 


In order to relieve the CPU of the I/O transfer control task, circuitry 
external to the CPU must be added. This circuitry is called the 
DMA Controller and performs the following functions. 


Address Line Control — Ina DMAsystem, the memory address 
lines are driven by either the CPU or a DMA device, depending on 
which is using the memory during a given cycle. The DMA con- 
troller must switch the appropriate address onto the memory 
address lines. 


Data Transfer Control — The DMA Controller must provide the 
control signals required to transfer data directly between memory 
and an I/O device. As with the address lines, these control signals 
must be switched onto and off of the memory control lines appro- 
priately. 


Address Maintenance — Just as the CPU has the program 
counter and one or more other registers for memory address 
pointers, the DMA controller must also maintain an address 
pointer that indicates where the next word of data will be read or 
written in memory. This pointer must be incremented or dec- 
remented after each word transfer. 


Word Count Maintenance — At the initialization of a DMA 
transfer, the CPU specifies to the DMA Controller the total 
number of words to be transferred. During the transfer, the DMA 
controller must maintain a count of the number of words that have 
been transferred and terminate the transfer when the specified 
number of words has been reached. 


Mode Control — Certain aspects of a DMA transfer, such as 
direction of data flow, method of termination, etc., may vary from 
one DMA transfer to the next. For this reason, a number of DMA 
modes may be required. Mode control logic contained in the DMA 
controller, is set by the CPU atthe initialization of a DMA transfer. 


A DMA Controller can be placed in each I/O device (Distributed 
DMA) or DMA control circuitry for a number of I/O devices can be 
placed in a separate unit (Centralized DMA). The former provides 
the advantage of incremental cost; DMA control circuitry is added 
only as I/O devices are added. The latter provides the advantages 
of consolidation. 


At DMA initialization, the CPU normally specifies the mode, the 
starting memory address and the number of words to be trans- 
ferred (word count) to the DMA controller. In some applications, it 
is desirable to repeat a DMA transfer over and over again without 
disturbing the CPU. This capability is called Repetitive DMA, and 
can be implemented by adding two registers to the DMA con- 
troller. One register saves the starting address and the other the 
starting word count. This allows the DMA Controller to automati- 
cally reinitialize itself after the transfer of the data has been 
completed, thereby eliminating the need for CPU intervention. 


The Am2940 DMA ADDRESS GENERATOR 


The design of the Address Line Control, Data Transfer Control 
and Mode Control circuitry of a DMA Controller is dependent 
upon system architecture and timing; therefore, it varies con- 
siderably from system to system. However, the address mainte- 
nance and word count maintenance circuitry is independent of 
these variables, and is common to almost all DMA Controllers. 
The Am2940 DMA Address Generator is designed for use in DMA 
Controllers and provides the Address and Word Count mainte- 
nance circuitry that is common to most. It combines the advan- 
tages of high speed bipolar LSI with the flexibility and general 
purpose usefulness of microprogrammed control. 


Am2940 GENERAL DESCRIPTION 


The Am2940, a 28-pin member of Advanced Micro Devices 
Am2900 family of Low-Power Schottky bipolar LSI chips, is a 
high-speed, cascadable, eight-bit wide Direct Memory Access 
Address Generator slice. Any number of Am2940s can be cas- 
caded to form larger addresses. 


The primary function of the device is to generate sequential 
memory addresses for use in the sequential transfer of data to or 
from a memory. It also maintains a data word count and gener- 
ates a DONE signal when a programmable terminal count has 
been reached. The device is designed for use in peripheral con- 
trollers with DMA capability or in any other system which transfers 
data to or from sequential locations of a memory. 


The Am2940 can be programmed to increment or decrement the 
memory address in any of four control modes, and executes eight 
different instructions. The initial address and word count are 
saved internally by the Am2940 so that they can be restored later 
in order to repeat the data transfer operation. 


ADDRESS REGISTER 


MULTIPLEXER 
7 


INSTRUCTION 
(Ig—!p) 


INSTRUCTION 


DECODER 


Am2940 ARCHITECTURE 


As shown in the Block Diagram of Figure 4, the Am2940 consists 

of the following: 

e A three-bit Control Register. 

@ An eight-bit Address Counter with input multiplexer. 

@ An eight-bit Address Register. 

@ An eight-bit Word Counter with input multiplexer. 

@ An eight-bit Word Count Register. 

@ Transfer complete circuitry. 

@ An eight-bit wide data multiplexer with three-state output buf- 
fers. 

e Three-state address output buffers with external output enable 
control. 

@ An instruction decoder. 


Control Register 


Under instruction control, the Control Register can be loaded or 
read from the bidirectional DATA lines Do-D7, Control Register 
bits O and 1 determine the Am2940 Control Mode, and bit 2 
determines whether the Address Counter increments or dec- 
rements. Figure 5 defines the Control Register format. 


Address Counter 


The Address Counter, which provides the current memory ad- 
dress, is an eight-bit, binary, up/down counter with full look-ahead 
carry generation. The Address Carry Input (ACI) and Address 
Carry Output (ACO) allow cascading to accommodate larger 
addresses. Under instruction control, the Address Counter can 
be enabled, disabled, and loaded from the DATA inputs, Dp-D7, 
or the Address Register. When enabled and the ACI input is 
LOW, the Address Counter increments/decrements on the LOW 
to HIGH transition of the CLOCK input, CP. The Address Counter 
output can be enabied onto the three-state ADDRESS outputs 
Ao-A7 under control of the Output Enable input, OE,. 


WORD COUNT 
REGISTER 
MULTIPLEXER 
0 


MPR-226 





Figure 4. Am2940 DMA Address Generator. 
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Control Register 


Control 
Mode Control Word 
=<S Mode Type pangs 
HIGH when HIGH when 


Word | Word Count Equals Zero | | Word Count Equals Zero | Zero 


oe ee Address Compare | Hold 
Hebets Word Gounter Cary Out 


Done Output Signai 
WCI = LOW WCI = HIGH 


HIGH when 
Word Counter + 1 = 
Word Count Register 


HIGH when Word Counter = Address Counter 
Always LOW 


HIGH when 
Word Counter = 
Word Count Register 





Address Counter 
L Increment 
H Decrement 


Figure 5. Control Register Format Definition. 





Address Register 


The eight-bit Address Register saves the initial address so that it 
can be restored later in order to repeat a transfer operation. When 
the LOAD ADDRESS instruction is executed, the Address Reg- 
ister and Address Counter are simultaneously loaded from the 
DATA inputs, Do-D7. 


Word Counter and Word Count Register 


The Word Counter and Word Count Register, which maintain and 
save a word count, are similar in structure and operation to the 
Address Counter and Address Register, with the exception that 
the Word Counter increments in Control Modes 1 and 3, decre- 
ments in Control Mode 0, and is disabled in Control Mode 2. The 
LOAD WORD COUNT instruction simultaneously loads the Word 
Counter and Word Count Register. 


Transfer Complete Circuitry 


The Transfer Complete Circuitry is a combinational logic network 
which detects the completion of the data transfer operation in 
three Control Modes and generates the DONE output signal. The 
DONE signal is an open-collector output, which can be dot-anded 
between chips. 


Data Multiplexer 


The Data Multiplexer is an eight-bit wide, 3-input multiplexer 
which allows the Address Counter, Word Counter, and Control 
Register to be read at the DATA lines, Dp-D7. The Data Multi- 
plexer and three-state Data output buffers are instruction con- 
trolled. 


Address Output Buffers 


The three-state Address Output Buffers allow the Address 
Counter output to be enabled onto the ADDRESS lines, Ap-A7, 
under external control. When the Output Enable input, OEg, is 
LOW, the Address output buffers are enabled; when OE, is 
HIGH, the ADDRESS lines are in the high-impedance state. The 
address and Data Output Buffers can sink 24mA output current 
over the commercial operating range. 


Instruction Decoder 


The Instruction Decoder generates required internal control sig- 
nals as a function of the INSTRUCTION inputs, Ip-!2 and Control 
Register bits 0 and 1. 


Clock 


The CLOCK input, CP, is used to clock the Address Register, 
Address Counter, Word Count Register, Word Counter, and 
Control Register, all on the LOW to HIGH transition of the CP 
signal. 


Am2940 CONTROL MODES 


Control Mode 0 — Word Count Equals Zero Mode 


In this mode, the LOAD WORD COUNT instruction loads the 
word count into the Word Count Register and Word Counter. 
When the Word Counter is enabled and the Word Counter 
Carry-in, WCI, is LOW, the Word Counter decrements on the 
LOW to HIGH transition of the CLOCK input, CP. Figure 5 
specifies when the DONE signal is generated in this mode. 


Control Mode 1 — Word Count Compare Mode. 


In this mode the LOAD WORD COUNT instruction loads the word 
count into the Word Count Register and clears the Word Counter. 
When the Word Counter is enabled and the Word Counter 
Carry-in, WCi, is LOW, the Word Counter increments on the LOW 
to HIGH transition of the clock input, CP. Figure 5 specifies when 
the DONE signal is generated. 


Control Mode 2 — Address Compare Mode 


In this mode, only an initial and final memory address need be 
specified. The initial Memory Address is loaded into the Address 
Register and Address Counter and the final memory address is 
loaded into the Word Count Register and Word Counter. The 
Word Counter is always disabled in this mode and serves as a 
holding register for the final memory address. When the Address 
Counter is enabled and the ACI input is LOW, the Address 
Counter increments or decrements (depending on Control Reg- 
ister bit 2) on the LOW to HIGH transition of the CLOCK input, CP. 
The Transfer Complete Circuitry compares the Address Counter 
with the Word Counter and generates the DONE signal during the 
last word transfer, i.e., when the Address Counter equals the 
Word Counter. 





Control Mode 3 — Word Counter Carry Out Mode 


For this mode of operation, the user can load the Word Count 
Register and Word Counter with the two’s complement of the 
number of data words to be transferred. When the Word Counter 
is enabled and the WCI input is LOW, the Word Counter incre- 
ments on the LOW to HIGH transition of the CLOCK input, CP. A 
Word Counter Carry Out signal, WCO, indicates the last data 
word is being transferred. The DONE signal is not required in this 
mode and, therefore, is always LOW. 


Am2940 INSTRUCTIONS 


The Am2940 instruction set consists of eight instructions. Six 
instructions load and read the Address Counter, Word Counter 
and Control Register, one instruction enables the Address and 
Word Counters, and one instruction reinitializes the Address and 
Word Counters. The function of the REINITIALIZE COUNTERS, 
LOAD WORD COUNT, and ENABLE COUNTERS instructions 
vary with the Control Mode being utilized. Table 1 defines the 
Am2940 Instructions as a function of Instruction inputs Ig-l2 and 
the four Am2940 Control! Modes. 


The WRITE CONTROL REGISTER instruction writes DATA 
input Dp-D2 into the Control Register; DATA inputs D3-D7 are 
“don't care” inputs for this instruction. The READ CONTROL 
REGISTER instruction gates the Control Register outputs to 
DATA lines, Do-Dz. DATA lines D3-D7 are in the HIGH state 
during this instruction. 


The Word Counter can be read using the READ WORD 
COUNTER instruction, which gates the Word Counter ouputs to 
DATA lines Do-D7. The LOAD WORD COUNT instruction is 
Control Mode dependent. In Control Modes 0, 2, and 3, DATA 
inputs Dg-D7 are written into both the Word Count Register and 
Word Counter. In Control Mode 1, DATA inputs Dp-D7 are written 
into the Word Count Register and the Word Counter is cleared. 


The READ ADDRESS COUNTER instruction gates the Address 
Counter outputs to DATA lines Dg-D7, and the LOAD ADDRESS 
instruction writes DATA inputs Dg-D7 into both the Address Reg- 
ister and Address Counter. 





in Control Modes 0, 1, and 3, the ENABLE COUNTERS instruc- 
tion enables both the Address and Word Counters; in Control 
Mode 2, the Address Counter is enabled and the Word Counter 


holds its contents. When enabled and the carry input is active, the 


counters increment on the LOW to HIGH transition of the CLOCK 
input, CP. Thus, with this instruction applied, counting can be 
controlled by the carry inputs. 


The REINITIALIZE COUNTERS instruction also is Control Mode 
dependent. In Control Modes 0, 2, and 3, the contents of the 
Address Register and Word Count Register are transferred to the 
respective Address Counter and Word Counter; in Control Mode 
1, the content of the Address Register is transferred to the Ad- 
dress Counter and the Word Counter is cleared. The 
REINITIALIZE COUNTERS instruction allows a data transfer 
operation to be repeated without reloading the address and word 
count from the DATA lines. 


Am2940 Timing 


Various computations must be performed by the designer to 
determine how fast the Am2940 can be operated reliably in a 
given design. The exercises of this section demonstrate how 
these computations are performed. 


Worst case A.C. characteristics, over the full temperature and 
voltage operating range should be used in these computations. 
Since, at the time of this writing, the Am2940 is still being charac- 
terized, only typical A.C. characteristics are available. These 
typicals are used here merely to demonstrate how the computa- 
tions are performed; the designer must use worst-case charac- 
teristics. Figure 6 shows the characteristics of a Schottky register 
and a memory which are assumed for this exercise. 


Figures 7A, B, and C show the typical cycle time calculations for 
the 16-bit Am2940 configuration. The typical delay along the 
longest path for any of the eight Am2940 instructions determines 
the typical cycle time. In each case, delays are computed from the 
LOW to HIGH transition of a clock through an entire microcycle to 
the next LOW to HIGH transition of a clock. The typical cycle time 
for a 16-bit Am2940 configuration is 64ns. 


TABLE |. Am2940 INSTRUCTIONS 







READ 


READ 
WORD 


READ 
ADDRESS | RDAC | 0,1,2,3 
COUNTER 
REINITIALIZE 


LOAD 
WORD 


0, 2,3 
LDWC 
COUNT 


0,1,3 
ENABLE ENCT 
COUNTERS 


WCR = Word Count Reg 
WC = Word Counter 
D = Data 


CR = Control Reg. 
AR = Address Reg. 
AC = Address Counter 


Octal Control Word Word Address Address Control eat 
Code Mode Reg. Counter Reg. Counter Register DoD, 
ONTROL, WRCR 0,1,2,3 | HOLD HOLD HOLD HOLD Do-Do> CR INPUT 
REGISTER 
CONTROL RDCR 0,1,2,3 | HOLD HOLD HOLD HOLD HOLD CR>Do-Do 
REGISTER (Note 1 
RDWC 0,1,2,3 | HOLD HOLD HOLD 
COUNTER 


FOLD |_WCR->WC AR>KG [HOLD 
ZERO>WC | HOLD AR-AC | HOLD 


LOAD 
appress | LOAD | 0.1.2.3 | HOLD | HOLD D>AR 


HOLD HOLD HOLD INPUT 


ENABLE 
HOLD COUNT HOLD COUNT HOLD 
HOLD HOLD HOLD ENABLE HOLD 
COUNT 
L = LOW 


H = HIGH 
Z = High Impedance 








HOLD HOLD WC ->D 





HOLD HOLD AC>D 











DAC HOLD INPUT 






Note 1° 
Data Bits D3-D7 are high during this instruction. 
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Schottky Register 
Clock to Output Delay 


Input Set-Up Time 


Memory 
Address Set-Up Time 





Figure 8 shows the address output enable time computations. 
Since the Am2940 has an asynchronous address output enable 
control, the address output enable time may not be related to the 
Am2940 cycle time. 


Figure 9 shows the typical cycle time calculation for an 8-bit 
Am2940 configuration. The path shown ts the longest path and 
determines an 8-bit typical cycle time of 52ns. 


The typical cycle time calculation for a 24-bit Am2940 configura- 
tion is shown in Figure 10. The path shown is the longest path and 
determines a 24-bit typical cycle time of 76ns. 


Figure 111s asummary of typical Am2940 cycle times for the 8, 16 


Figure 6. Assumed AC Characteristics. and 24-bit configurations. 
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READ ADDRESS COUNTER, 
READ WORD COUNTER 
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Figure 7. 16-Bit Typical Cycle Time Computations. 
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CALCULATIONS FOR 
WRITE CONTROL REG, 
LOAD WORD COUNT, 
LOAD ADDRESS 
INSTRUCTIONS 






~~ ~DATAIN DATA OUT 


CLK 


STATUS 
REG 








ADDRESS 


MEMORY 





MPR-553 






DEVICE TYPE DEVICE PATH 


PATH 1 | PATH2 | PATH3 
Schottky Reg CLK to Q | 
2940 Inst Set-Up 
2 














oe 
200 | ar 
2040 | ciktowco | | 
| 2040 || weltopone | | 
cael 
pda 


pP AT H 1 SOR VASIS Dae heed meas oS Ey 
Pp {\ TH 2 Teo Fea BY ka cabaret SF ead A 
PAT H 3 eS Cc es oo ere a 


ae 
aan ao 
ace een 
Feuxte woo _| 
iaianl iy 
aa 


Totai-ns | 


4 





c) 


CALCULATIONS FOR 

REINITIALIZE COUNTERS, 
ENABLE COUNTERS 
INSTRUCTIONS ; * = 
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Figure 7. 16-Bit Typical Cycle Time Computations. (Cont.) 
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CALCULATIONS FOR 
ENABLE MEMORY ADDRESS 
(ASYNCHRONOUS) 
INSTRUCTIONS 


DATA IN DATA OUT 
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Figure 8. Speed Computations. 
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Figure 9. 8-Bit Typical Cycle Time Computation. 
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Figure 10. 24-Bit Typical Cycle Time Computation. 
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Figure 11. Summary of Am2940 Cycle Times. 


AN EXAMPLE DESIGN 


The Am2940 is designed for use in high speed peripheral Con- 
trollers using DMA and provides the address and word count 
maintenance circuitry that is common to most. As indicated previ- 
ously, DMA Control can be placed in each I/O Controller (Distri- 
buted DMA) or DMA Control for a number of I/O devices can be 
centralized in a separate unit. 


Figure 12 shows a block diagram of a microprogrammed |/O 
Controller which is designed for use in a Distributed DMA system. 
The Am2910 Microprogram Sequencer, Microprogram Memory 
and the Microinstruction Register form the microprogram control 
portion of this I/O Controller. The Am2940 maintains the memory 
address and word count required for DMA operation. An internal 
three-state bus provides the communication path between the 
Microinstruction Register, the Am2917 Data Transceivers, the 
Am2940, the Am2901A Microprocessor, and the Device Interface 


Circuitry. The Address Line Control, Data Transfer Control and 
Mode Control functions of this DMA Controller are incorporated 
into the I/O Controller Microprogram and the Asynchronous 
Interface Control Circuitry. The I/O Controller Microprogram also 
controls the Am2940. 


The Am2940 interconnections are shown in detail in Figure 13. 
Two Am2940s are cascaded to generate a sixteen-bit address. 
The Am2940 ADDRESS and DATA output current sink capability 
is 24mA over the commercial operating range. This allows the 
Am2940s to drive the System Address Bus and Internal Three- 
State Bus directly, thereby eliminating the need for separate bus 
drivers. Three bits in the Microinstruction Register provide the 
Am2940 Instruction Inputs, Ip-l2. The microprogram clock is used 
to clock the Am2940s and, when the ENABLE COUNTERS in- 
struction is applied, address and word counting is controlled by 
the CNT bit of the Microinstruction Register. 


Asynchronous interface control circuitry generates System Bus 
control signals and enables the Am2940 Address onto the Sys- 
tem Address Bus at the appropriate time. The open-collector 
DONE outputs are dot-anded and used as a test input to the 
Am2910 Microprogram Sequencer. 


The I/O controller read operation is flowcharted in Figure 14. The 
CPU initializes the I/O controller by sending a read command, the 
starting memory address, the word count and any other parame- 
ters required to perform the operation. The 1/O Controller then 
obtains a word of data from the I/O device and requests use of the 
system bus for a DMA transfer. When the bus is granted, the I/O 
Controller requests a memory data transfer. Upon receipt of the 
memory acknowledge signal, which indicates the memory trans- 
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Figure 12. DMA Peripheral Controller Block Diagram. 
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Figure 13. Am2940 Interconnections. 
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Figure 14. Read Control Flowchart. 


fer is complete, the I/O Controller tests the word count. If the word 
count is not equal to zero, the word counter is decremented, the 
address counter is incremented and another data word ts trans- 
ferred. When the word count reaches zero, the I/O Controller 
terminates the data transfer and informs the CPU thatthe transfer 
has been completed. 


THE Am2942 PROGRAMMABLE TIMER/COUNTER, 
DMA ADDRESS GENERATOR. 


GENERAL DESCRIPTION 


The Am2942, a 22-pin version of the Am2940, can be used as 
a high-speed DMA address Generator or Programmable 
Timer/Counter. It provides multiplexed Address and Data lines, 
for use with a common bus, and additional Instruction Input and 
Instruction Enable pins. The Am2942 executes 16 instructions; 
eight are the same as the Am2940 instructions, and eight in- 
structions facilitate the use of the Am2942 as a Programmable 
Timer/Counter. The Instruction Enable input allows the sharing 
of the Am2942 instruction field with other devices. 


When used as a Timer/Counter, the Am2942 provides two in- 
dependent, programmable, eight-bit, up-down counters in a 
22-pin package. The two on-chip counters can be cascaded to 
form a single chip, 16-bit counter. Also, any number of chips 
can be cascaded — for example three cascaded Am2942s form 
a 48-bit timer/counter. 


Reinitialization instructions provide the capability to reinitialize 
the counters from on-chip registers. Am2942 Programmable 
Control Modes, identical to those of the Am2940, offer four dif- 
ferent types of programmable control. 


Am2942 ARCHITECTURE 

As shown in the Block Diagram, the Am2942 consists of the 
following: 

A three-bit Control Register. 

An eight-bit Address Counter with input multiplexer. 

An eight-bit Address Register. 

An eight-bit Word Counter with input multiplexer. 

An eight-bit Word Count Register. 

Transfer complete circuitry. 

An eight-bit wide data multiplexer with three-state output 
buffers. 

An instruction decoder. 
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Figure 15. Am2942 Block Diagram. 


Control Register 


Under instruction control, the Control Register can be loaded or 
read from the bidirectional DATA lines, Do-D7. Control Register 
bits O and 1 determine the Am2942 Control Mode, and bit 2 
determines whether the Address Counter increments or dec- 
rements. Figure 16 defines the Control Register format. 


Address Counter 


The Address Counter, which provides the current memory ad- 
dress, is an eight-bit, binary, up/down counter with full look-ahead 
carry generation. The Address Carry input (ACI) and Address 
Carry Output (ACO) allow cascading to accommodate larger 


addresses. Under instruction control, the Address Counter can 
be enabled, disabled, and loaded from the DATA inputs, Do-D7, 
or the Address Register. When enabled and the ACI input is 
LOW, the Address Counter increments/decrements on the LOW 
to HIGH transition of the CLOCK input, CP. 


Address Register 


The eight-bit Address Register saves the initial address so that it 
can be restored later in order to repeat a transfer operation. When 
the LOAD ADDRESS instruction is executed, the Address Reg- 
ister and Address Counter are simultaneously loaded from the 
DATA inputs, Do-D7. 


Control Register 


Control Done Output Signal 
Mode Control Word 
Number Mode Type peg Wc! = LOW | Wor-HiGH | 
HIGH when HIGH when 
none Word Count Equals Zero | Decrement Word'Counter=-4 Word Counter = 0 


HIGH when 
Word Count Compare Word Counter + 1 = 
Word Count Register 
PSH teens ee Address Compare HIGH when Word Counter = Address Counter 


HIGH when 
Word Counter = 
Word Count Register 


| CR | Address Counter 
L Increment 
H Decrement 


Figure 16. Control Register Format Definition. 








Word Counter And Word Count Register 


The Word Counter and Word Count Register, which maintain and 
save a word count, are similar in structure and operation to the 
Address Counter and Address Register, with the exception that 
the Word Counter increments in Control Modes 1 and 3 and 
decrements in Control Modes 0 and 2. The LOAD WORD 
COUNT instruction simultaneously loads the Word Counter and 
Word Count Register. 


Transfer Complete Circuitry 


The Transfer Complete Circuitry is a combinational logic network 
which detects the completion of the data transfer operation in 
three Control Modes and generates the DONE output signal. The 
DONE signal is an open-collector output, which can be dot-anded 
between chips. 


Data Multiplexer 


The Data Multiplexer is an eight-bit wide, three-input multiplexer 
which allows the Address Counter, Word Counter and Control 
Register to be read at DATA lines Dp-D7. The Data Multiplexer 
output, Yo-Y7, is enabled onto DATA lines Dy-D;7 if, and only if, the 
Output Enable input, OEp, is LOW. (Refer to Figure 17.) 


OE | 


L 
H 


Figure 17. Data Bus Output Enable Function. 
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Instruction Decoder 


The Instruction Decoder generates required internal control sig- 
nals as a function of the INSTRUCTION inputs, I9-l3 Control 
Register bits 0 and 1, and the INSTRUCTION ENABLE input, Ie. 


Clock 


The clock input, CP, is used to clock the Address Register, 
Address Counter, Word Count Register, Word Counter, and 
Control Register, all on the LOW to HIGH transition of the CP 
signal. 


Am2942 CONTROL MODES 


Control Mode 0 — Word Count Equals Zero Mode 


In this mode, the LOAD WORD COUNT instruction loads the 
word count into the Word Count Register and Word Counter. 
When the Word Counter is enabled and the Word Counter 
Carry-in, WCl, is LOW, the Word Counter decrements on the 
LOW to HIGH transition of the CLOCK input, CP. Figure 16 
specifies when the DONE signal is generated in this mode. 


Control Mode 1 — Word Count Compare Mode 


In this mode the LOAD WORD COUNT instruction loads the word 
count into the Word Count Register and clears the Word Counter. 
When the Word Counter is enabled and the Word Counter 
Carry-in, WCIl, is LOW, the Word Counter increments on the LOW 
to HIGH transition of the clock input, CP. Figure 16 specifies 
when the DONE signal is generated. 





Control Mode 2 — Address Compare Mode 


In this mode, only an initial and final memory address need to be 
specified. The initial Memory Address is loaded into the Address 
Register and Address Counter and the final memory address is 
loaded into the Word Count Register and Word Counter. The 
Word Counter serves as a holding register for the final memory 
address. When the Address Counter is enabled and the ACI input 
is LOW, the Address Counter increments or decrements (de- 
pending on Control Register bit 2) on the LOW to HIGH transition 
of the CLOCK input, CP. The Transfer Complete Circuitry com- 
pares the Address Counter with the Word Counter and generates 
the DONE signal during the last word transfer, i.e., when the 
Address Counter equals the Word Counter. 


Control Mode 3 — Word Counter Carry Out Mode 


For this mode of operation, the user can load the Word Count 
Register and Word Counter with the two’s complement of the 
number of data words to be transferred. When the Word Counter 
is enabled and the WCi input is LOW, the Word Counter incre- 
ments on the LOW to HIGH transition of the CLOCK input, CP. A 
Word Counter Carry Out signal, WCO, indicates the last data 
word is being transferred. The DONE signal ts not required in this 
mode and, therefore, is always LOW. 





Am2942 INSTRUCTIONS 


The Am2942 instruction set consists of sixteen instructions. 
Eight are DMA instructions and are the same as the Am2940 
instructions. The remaining eight instructions are designed to 
facilitate the use of the Am2942 as a Programmable Timer/ 
Counter. Figures 18 and 19 define the Am2942 Instructions. 


Instructions 0-7 are DMA instructions. The WRITE CONTROL 
REGISTER instruction writes DATA input Do-Dz into the Control 
Register; DATA inputs D3-D7 are “don’t care” inputs for this 
instruction. The READ CONTROL REGISTER instruction gates 
the Control Register to Data Multiplexer outputs Yo-Y5. Outputs 
Y3-Y7 are HIGH during this instruction. 


The Word Counter can be read using the READ WORD 
COUNTER instruction, which gates the Word Counter to Data 
Multiplexer outputs, Yo-Y7. The LOAD WORD COUNT instruc- 
tion is Control Mode dependent. In Control Modes 0, 2 and 3, 
DATA inputs Do-D7 are written into both the Word Count Register 
and Word Counter. In Control Mode 1, DATA inputs Dg-D7 are 
written into the Word Count Register and the Word Counter is 
cleared. 


The READ ADDRESS COUNTER instruction gates the Address 
Counter to Data Multiplexer outputs, Yo-Y7, and the LOAD AD- 
DRESS instruction writes DATA inputs Dg-D7 into both the Ad- 
dress Register and Address Counter. 


In Control Modes 0, 1, and 3, the ENABLE COUNTERS instruc- 
tion enables both the Address and Word Counters; in Control 
Mode 2, the Address Counter is enabled and the Word Counter 
holds its contents. When enabled and the carry input is active, the 
counters increment on the LOW to HIGH transition of the CLOCK 
input, CP. Thus, with this instruction applied, counting can be 
controlled by the carry inputs. 


The REINITIALIZE COUNTERS instruction also is Control Mode 
dependent. In Control Modes 0, 2, and 3, the contents of the 
Address Register and Word Count Register are transferred to the 
respective Address Counter and Word Counter; in Control Mode 
1, the content of the Address Register is transferred to the Ad- 
dress Counter and the Word Counter is cleared. The 
REINITIALIZE COUNTERS instruction allows a data transfer 
operation to be repeated without reloading the address and word 
count from the DATA lines. 
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Notes. 1 When lz ts tiled LOW, the Am2942 acts as a DMA circuit. When lg 1s tied HIGH, the Am2942 acts 


as a Timer/Counter circuit 


2. Am2942 instructions 0 through 7 are the same as Am2940 instructions. 





Figure 18. Am2942 Instructions 


When Ie is HIGH, Instruction inputs, Ip-l2, are disabled. If I is 
LOW, the function performed is identical to that of the 
ENABLE COUNTERS instruction. Thus, counting can be con- 
trolled by the carry inputs with the ENABLE COUNTERS instruc- 
tion applied or with Instruction Inputs Ip-l2 disabled. 


Instructions 8-F facilitate the use of the Am2942 as a Pro- 
grammable Timer/Counter. They differ from instructions 0-7 in 
that they provide independent control of the Address Counter, 
Word Counter and Control Register. 


The WRITE CONTROL REGISTER, T/C instruction writes DATA 
input Do-Dz into the Control Register. DATA inputs D3-D7 are 
“don't care” inputs for this instruction. The Address and Word 
Counters are enabled, and the Control Register contents appear 
at the Data Multiplexer output. 


The REINITIALIZE ADDRESS COUNTER instruction allows the 
independent reinitialization of the Address Counter. The Word 
Counter is enabled and the contents of the Address Counter 
appear at the Data Multiplexer output. 


The Word Counter can be read, using the READ WORD 
COUNTER, T/C instruction. Both counters are enabled when this 
instruction is executed. 


When the READ ADDRESS COUNTER, T/C instruction is exe- 
cuted, both counters are enabled and the address counter con- 
tents appear at the Data Multiplexer output. 


The REINITIALIZE ADDRESS and WORD COUNTERS instruc- 
tion provides the capability to reinitialize both counters at the 
same time. The Address Counter contents appear at the Data 
Multiplexer output. 


DATA inputs Do-D7 are loaded into both the Address Register 
and Counter when the LOAD ADDRESS, T/C instruction is exe- 
cuted. The Word Counter is enabled and its contents appear at 
the Data Multiplexer output. 


The LOAD WORD COUNT, T/C instruction is identical to the 
LOAD WORD COUNT instruction with the exception that Ad- 
dress Counter is enabled. 


The Word Counter can be independently reinitialized using the 
REINITIALIZE WORD COUNTER instruction. The Address 
Counter is enabled and the Word Counter contents appear at the 
Data Multiplexer output. 


When the I input is HIGH, Instruction inputs, Io-l2, are disabled. 
The function performed when |3 is HIGH is identical to that per- 
formed when |; is LOW, with the exception that the Word Counter 
contents appear at the Data Multiplexer output. 


EXAMPLE DESIGNS 


Figure 20 shows an Am2942 used as two independent, pro- 
grammable eight-bit timer/counters. In this example, an 
Am2910 Microprogram Sequencer provides an address to 
Am29775 512 x 8 Registered PROMs. The on-chip PROM out- 
put register is used as the Microinstruction Register. 


The Am2942 Instruction input, I3 is tied HIGH to select the eight 
Timer/Counter instructions. The Ie, Ig-l2, and OEp inputs are 
provided by the microinstruction, and the Dg-D7 data lines are 
connected to a common Data Bus. GATE WC and GATE AC are 
separate enable controls for the respective Word Counter and 
Address Counter. The DONE, ACO and WCO output signals 
indicate that a pre-programmed time or count has been reached. 
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Figure 19. Am2942 Function Table. 
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Figure 20. Two 8-Bit Programmable Counters/Timers in a 22-Pin Package. 


Figure 21 shows an Am2942 used as a single 16-bit, pro- Figure 22 shows two Am2942s cascaded to form a 32-bit pro- 
grammable timer/counter. In this example, the Word Counter grammable timer/counter. The two Word Counters form the low 
carry-out, WCO, is connected to the Address Counter carry-in, order 16 bits, and the two Address Counters form the high 
ACI, to form a single 16-bit counter which is enabled by the order bits. This allows the timer/counter to be loaded and read 
GATE signal. 16 bits at a time. 
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Figure 21. 16-Bit Programmable Counter/Timer Using a Single Am2942. 
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Figure 22. 32-Bit Programmable Counter/Timer Using Two Am2942s. 


In Figure 23, two Am2942s are shown cascaded to form dual 
16-bit counters/timers. GATE WC and GATE AC are separate 
enable controls for the respective Word Counter and Address 
Counter. Using the 16-bit Data Bus, each 16-bit counter can be 
loaded or read in parallel. 
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Figure 23. Dual 16-Bit Programmable Counters/Timers. 


Figure 24 shows two Am2942s used as DMA address Generators 
on acommon DATA/ADDRESS bus. The common bus allows the 
use of the Am2942 multiplexed data and address pins, Do-D7. The 
Am2942 is in a 22 pin package whereas the Am2940, which has 
separate address and data pins, requires a 28 pin package. 
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Figure 24. Am2942s Used as DMA Address Generator on Common Bus. 


In this example the Am2942 Address Counter, Word Counter 
and Control Register are loaded and read directly from the 
CPU via the DATA/ADDRESS bus. Since the bus carries ad- 
dresses as well as data, the Do-D7 pins can be used also to 
enable the address onto the bus. 


Four bits in the Microinstruction Register provide the Am2942 
Instruction Inputs, Io-lp and the Instruction Enable input l_. The 
l4 input is tied LOW, selecting the eight DMA instructions. The 


microprogram clock is used to clock the Am2942s, and when 
the ENABLE COUNTERS instruction is applied or the instruc- 


tion is disabled (le = HIGH), address and word counting is 
controlled by the CNT bit of the Microinstruction Register. 


Interface control circuitry generates bus control signals and en- 
ables the Am2942 address onto the bus at the appropriate. The 
open-collector DONE outputs are dot-anded and used as a test 
input to the microprogram sequencer. 
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INTRODUCTION 


Modern digital systems are becoming faster and increasingly 
complex. As a result, more is being demanded of digital design 
engineers. Fortunately, there is a design technique that can 
greatly simplify the design process. It can also lead to cleaner, 
more efficient, more reliable finished devices. This technique is 
called MICROPROGRAMMING. Do not be confused by this 
word; it has nothing whatever to do with machine level language 
Or programming a microprocessor. Microprogramming is inher- 
ently more powerful than programming in a processor's instruc- 
tion set for many reasons, not the least of which ts the access to 
the entire functional resources of the hardware on a machine 
cycle by machine cycle basis. An excellent treatment of micro- 
programming and microprogrammed machines is available from 
AMD in previous application notes. Perhaps the easiest to com- 
prehend introduction to this subject is in AMD’s Microprogram- 
ming Handbook. This is highly recommended reading for any 
newcomer to this area of digital design. 


Though microprogramming has always been an inherently more 
powerful design technique since its invention in 1955, ithas been 
little used until recently (1976), and with some justification. The 
reason is quite simple. The very large majority of IC’s available 
until the 1976-1978 time frame were specifically designed to be 
used with ‘random logic’ design techniques. Since these random 
logic IC’s were poorly suited to the highly structured nature of well 
designed microprogrammed systems, the potential advantages 
of microprogrammed systems could not be realized easily. 


Fortunately for all of us, in the mid 1970's AMD made a significant 
decision to develop a very extensive family of Schottky technol- 
ogy IC’s specifically optimized for use in microprogrammed sys- 
tems. These circuits belong to the Am2900 family as well as the 
Am25S, Am26S, Am27S, and Am25LS families. The acceptance 
has been so great that many of the other large IC manufacturers 
are now second sourcing many of these parts and introducing 
others. So, in just three to four years, microprogrammed machine 
design has come of age. Now, for most any job of medium to very 
high complexity, a microprogrammed system is the only way to go 
if a microprocessor isn't fast or versatile enough. 


The purpose of this application note is to illustrate the use of 
microprogramming and ‘bit-slice’ technology in a high perfor- 
mance 16-bit time-sharing CPU.This application note is unique in 
that the CPU being described is the heart of anew commercially 
available minicomputer system. Thus, it is possible to examine 
the nature of the CPU as it relates to a complete basic minicom- 
puter system. For this reason, a very short section follows that 
describes the basic system elements and the system goals to- 
ward which the CPU was designed. 


The product described herein is called the “HEX-29” CPU. Infor- 
mation on the AMD devices embodied in this application note 
should be directed to AMD via your local AMD representatives. 
Inquiries about the HEX-29 CPU and minicomputer system for 
OEM and/or end user applications should be directed to: 


HEX-29 
. Digital Microsystems, Inc. 
4448 Piedmont 
Oakland, CA 94661 
(415) 658-8532 


SYSTEM DESIGN GOALS 


In any significant project it is mandatory that reasonable, coher- 
ent system design goals be spelled out before serious work is 
begun. This can be a surprisingly short list of general specifica- 
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tions, but a well thought out system philosophy can make ail the 
difference. Most important, everyone involved should have a 
copy so everyone will be pulling in the same direction. 


The following list represents the system design goals for the 
HEX-29 CPU and system. 


Compact, reliable, easy to use. 

Multi-user, multi-task, timesharing. 

Fast, code-efficient high level language processing. 

Low cost for complete system. 

Intelligent microprogrammed channel controllers for high 
speed I/O 


a eS 


indeed, this seems like a short list, but it is the list from which the 
more detailed specifications were developed. For example, in 
order to be compact, switching power supply technology ts em- 
ployed. Reliability evolves from many factors including burn tn 
and testing cycles. Probably the single largest cause of ‘flakiness’ 
In digital systems ts insufficient cooling. An oversize fan moves 
about five times the volume of air past the IC’s as is normally 
recommended. This large, slower speed fan has the additional 
advantage that the lower frequency ‘white noise’ It generates is 
far less annoying than the ‘whine’ from smaller high speed fans. 


SO, it Is easy to see that many of the more specific details of 
system design will fall readily out of these overall design goals. 
The features of the final HEX-29 system are shown below. It 
should be instructive to trace each of these features to one (or 
more) of the design goals listed above. Reviewing this list will also 
prepare the context for the more detailed sections to follow in later 
sections. 


HEX-29 FEATURES 


VERY FAST 

—160ns basic machine cycle 

—Only two machine cycles for many instructions 
—Microprogrammed clock for increased through-put 


COMPLETE SET OF DATA TYPES 
—Bit operations 

—Nibble operations 

—Byte operations 

—Word operations 

~Double word operations 

—Quad word operations 

—Vartable field operations 


EXTENSIVE REGISTER SET 

—16 general purpose/defined purpose registers 
—16 memory management registers 
—Extended function condition code register 
—4 interrupt control/status registers 


MICROPROGRAMMED 

—Expandable instruction set (on board) 
—Writable/fixed control store capability 

—Integral fixed/floating point processor 

—Highly structured, comprehendible, modular design 


SOPHISTICATED MEMORY MANAGEMENT 
—Multi-user and multi-task timesharing structure 
—Complete intertask protection and security 

—Megabyte addressing space (expandable) 

— Software protectable pages for shared re-entrant coding 
—Dual mode operating capability 


MULTIPLE STACK PROCESSOR 

—Sophisticated program linkage through defined contro! stack 
pointer 

—Multiple, general register, data stack processing 
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SOPHISTICATED INTERRUPT STRUCTURE 

—8 level maskable vectored prioritized hardware interrupts 
—Second level prioritized expansion on each hardware level 
—256 levels of program controlled software interrupts 
—Invalid memory access trap is a vectored interrupt 
—Non-existent instruction trap is a vector interrupt 
~—Breakpoint instruction is a special vectored interrupt 

— Automatic mode switching on all interrupts 


HIGH THRU-PUT DMA/REFRESH CONTROL 

~8 level prioritized DMA requests and acknowledges 

— Up to four Mega-byte/second DMA transfer rate without slowing 
program execution 

—Up to 12 Mega-byte/second DMA transfer rate 

—Integral transparent dynamic memory refresh control 


EXTENSIVE HIGH LEVEL INSTRUCTION SET 

~Multitude of data types handled 

—Enormous variety of addressing modes 

—General register and defined register classes of instructions 

—Many very fast numeric and string macroinstructions 

—Integral 16 and 32-bit integer and 64-bit floating point ADD, 
SUB, MUL, DIV, CMP, NEG, etc 

—Advanced character, byte and word string processing 

—Microcoded high level language primitives 


VERY HIGH QUALITY PHYSICAL DESIGN 

—Four layer P.C. cards throughout system (internal GND and 
Vcc) 

—All bus signals interleaved with direct return ground path 

—All bus signals active low; three-state to inactive level 


INTELLIGENT CHANNEL CONTROLLERS 

~—Microprogrammed floppy disk and hard disk controllers 

~ Services multiple users I/O simultaneously and transparent to 
CPU program execution 

— Reduces executive program complexity and speeds execution 


SOFTWARE SUPPORT 

—Multi-user/multi-task time sharing operating system includes 
sophisticated file management features 

—Sophisticated resident macro-assembler 

—Customized micro-assembler 

— Superfast, super extended BASIC interpreter 

-True PASCAL compiler (not interpreter) 

—Advanced editor and word processor package 

—More software coming 


It should be clear from this list that the HEX-29 minicomputer Is a 
powerful/sophisticated design This is DIRECTLY attributable to 
the availability of the excellent Schottky technology I.C.’s availa- 
ble from AMD for use In microprogrammed digital systems. 


In a well designed microprogrammed system there should be 
VERY few random logic gate packages required. In the HEX-29 
CPU, there are only a few gates used as such. If anywhere near 
20% of a microprogrammed system is composed of gate pack- 
ages, It is probable that the design can be further simplified to 
replace the random logic with microcode and/or structured logic 
techniques It is important to note that the more functions that are 
implemented with structured logic and controlled by microcode 
bits, the more versatile and general is the whole design 





MICROPROGRAMMED MACHINES 


It is highly recommended that AMD’s MICROPROGRAMMING 
HANDBOOK be studied before this application note if a detailed 
understanding of the HEX-29 CPU is desired. The idea is, of 
course, that the basic principles of microprogrammed machines 
be familiar before this specific example is examined. The 
Am2900 Learning and Evaluation Kit is also recommended as a 
practical introduction tool. For those only interested in the capa- 
bilities of a well designed microprogrammed CPU, that reading is 
not entirely necessary, and Section V of this Application note will 
be superfluous. Section IV is a more general discussion for these 
readers, but is also necessary for those going on to Section V. 


A short discussion of microprogrammed systems appears here 
only as a short refresher for those who have studied the MICRO- 
PROGRAMMING HANDBOOK by John Mick and Jim Brick of 
AMD. 


Any microprogrammed machine can be divided into the following 
two discrete parts: 


1. Control store and microprogram control 
2. Data routing and function logic 


These two sections of a microprogrammed machine are really 
quite nearly independent. In effect, the control store and micro- 
program control section is the ‘boss and brains’ of the operation. It 
Issues all of the orders and makes all the decisions. The data 
routing and function logic devices are merely puppets that carry 
out the commands selected by the microprogram control logic 
from the control store. Note that ‘microword memory’ and ‘mi- 
crocode’ are used interchangeably with ‘control store’ and are 
synonomous. 


Control Store and Microprogram Control 


The control store is simply a number of PROM’s. The number of 
locations in this memory is chosen to be large enough to hold the 
desired number of microprogram routines. The width of the word 
is chosen to have sufficient bits to control all of the possible 
functions in the data routing and function logic. Admittedly, RAM 
or EPROM could be used as the memory devices, but it is best to 
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think of it as an array of read only memory devices. So, schemati- 
cally an example of a control store array looks like Figure 1. 


In practice, there is a register between the microword data bits 
and the actual data routing and function control devices. This 
register assures that all bits change simultaneously at the begin- 
ning of each new microinstruction cycle and allows the execution 
of one microinstruction with the fetching of the next. The addition 
of this ‘pipeline register’ is shown in the Figure 2 expansion of our 
block schematic 


The remaining part of this section is the microprogram control 
unit, more commonly called the microprogram sequencer. The 
microprogram sequencer is nothing more than a presettable bi- 


nary counter with a few extra functions. Figure 3 shows this 
device in place. 


We show the sequencer as a 12-bit binary counter with a few 
other inputs. The outputs (Y) drive the address lines of the control 
store PROMs. So, each time the system clock rises, the counters 
increment and sequential addresses are accessed from the 
PROM Note that the current output of the control store is cap- 
tured in the pipeline register on this same LOW-to-HIGH trans!- 
tion. Thus, the sequencer is always fetching the NEXT control 
store word which will control the fetching of the next, and so on 
and so on. 


Note that there are several bits from the pipeline register that are 
routed back to the sequencer. In our example, 12 bits are used as 
a microword branch address and another bit is used as a preset 
enable (load) line Normally, each cycle of the system clock 
increments the sequencer outputs and the next microword is 
fetched from the control store. However, somewhere down the 
line we are going to want to branch to a microcode sequence that 
is not ‘In line’ with the code that ts currently executing. It is very 
easy to see how this is done. 


The microaddress of the routine to which we want to branch is 
imbedded in the current microword, 12 bits in our example. The 
microword bit that is connected to the load input of the sequencer 
is coded to be low on this cycle. So, the sequencer, which ts really 
just a 12-bit counter with a unique load control in our example, 
will cause the branch address we selected to pass through to the 
output of the counters and fetch the microword from the micro- 
address to which we branched. The routine will now continue to 
execute sequentially addressed microwords until we execute 
another branch code 


The only other really necessary function we need from our se- 
quencer ts the ability to do conditional branches. In other words, 
we want to be able to branch to some microcode routine, but only 
if a certain condition exists. As usual, this capability is easily 
added; only one multiplexer is needed. Figure 4 shows the new 
configuration. 
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Now two additional microword bits control the conditions under 
which a microbranch will take place. If input O Is selected, a 
branch will always take place since the logic LOW level on input 0 
will appear at the load input of the sequencer. Conversely, if input 
3 1s selected, a HIGH logic level is always routed through the 
multiplexer to the load input and a load is not performed. Thus the 
next sequential microinstruction is fetched. So far we can do 
branch and continue functions with the multiplexer. 


If we select inputs 1 or 2 on the condition select multiplexer, we 
may get one of two conditions. If the selected input is HIGH, it will 
be routed to the load input of the sequencer and no load wiil take 
place. But if the selected condition is at a LOW logic level, the 
load input of the sequencer ts pulled LOW, a load is performed, 
and a branch has been accomplished. Since a branch only occurs 
when the condition bit is LOW, this function ts called a ‘branch on 
condition = 0’. Clearly a ‘branch on condition = 1’ can be im- 
plemented simply by inverting the condition bit before it enters the 
multiplexer. 


So as far as controlling the flow of microprograms goes, itis clear 
that we can make it look very much like assembly language 
programming of a microcomputer. We can execute sequential 
microinstructions (in line code), branch conditionally, or branch 
unconditionally. If we use real live sequencers like the Am2909, 
Am2910 or Am2911 instead of binary counters we get several 
other very important functions including micro-subroutining and 
looping 


When we substitute Am2909’s, Am2910’s or Am2911’s as our 
sequencers, the final element of our complete microprogram 
memory and control section is in place. Figure 5 shows this 
configuration. 


The next address PROM of Figure 5 converts the microcode 
branch function bits into one of two sets of bits that control the 
function performed by the Am2911’s. Which of the two is chosen 
depends upon the logic level of the particular condition bit that is 
selected. 


This ts the basic structure of any microprogram control unit re- 
gardless of what the rest of the system looks like The width of the 
microword data word, the microaddress field, the condition select 
field, etc , will change as needed, but the structure remains the 
same. Note that some of the microword data bits are used to 
control the microprogram sequencing logic. The bits left over are 
used to control the data routing and function logic in the device, 
I.e., everything else! 


Data Routing and Function Logic 


The data routing and function logic section of a micropro- 
grammed machine closely reflects the job the device ts to per- 
form. In this respect there is some similarity with random logic 
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designs. The key difference is the glue that binds all of the small 
functional units that make a device work. In arandom logic design 
it is a more or less random array of gates and flip-flops that 
interconnect and control these functional units. 


The chief advantage of a microprogrammed machine is that this 
random logic is largely replaced by the coherent sequences of 
control bits that is the microprogram. Problems such as race 
conditions, undesirable interactions between functional elements 
and marginal timing nearly disappear in a microprogrammed 
design. Often there are one or two internal data buses on which all 
transfers of internal data between functional units take place. 


Think of several possible sources of information that may be 
needed in a particular design. If they are all three-statable de- 
vices, microword bits could be tied to the output enable of each 
and the desired device enabled onto the internal bus on a micro- 
cycle by microcycle basis. Likewise one or more devices may 
capture this data. Microword bits attached to the clock pulse (CP) 
inputs of registers and the like can achieve this function. 


Further, microword bits select other functions to be performed, for 
example an ALU or shift function. Much of Section V of this 
application note will demonstrate the use of these data routing 
and function logic control bits. 





GENERAL SPECIFICATIONS 


The following section of this application note explores the design 
of the HEX-29 CPU on an intermediate level. It will be similar in 
detail to the detailed hardware and software specifications given 
for most microprocessors by the manufacturers. In other words, 
all the information needed to use the HEX-29 CPU, including bus 
timing and instruction set, are examined. This will serve to dem- 
onstrate what can be achieved in a medium level micropro- 
grammed machine. It will also serve as a necessary transition for 
those planning to study the more detailed internal structure of the 
CPU in the next section of this application note. 


It is very important, when designing a microprogrammed ma- 
chine, that the target device be specified in detail approaching 
that given in this section. Only then can an intelligent attempt at 
hardware design begin. It is especially important to define a 
clean, simple, reliable interface between the microprogrammed 
device and other system elements. Considerable attention 
should also be paid to defining data types, instruction formats, 
interrupt requirements, etc. 


internal CPU Registers 


The HEX-29 CPU has 36 internal registers. Of these, 16 are 
memory management (map) registers, 16 are general purpose 
registers, three are associated with the interrupt structure, and 
one Is the condition code register. 


Table 1 shows the functions associated with the 16 general pur- 
pose registers of the HEX-29. It is most significant that all 16 
general purpose registers have alternate functions. This should 
not imply that they are not true general purpose registers how- 
ever. Any register can be used as an accumulator, stack pointer, 
index register, memory pointer, data counter, etc., in most in- 
structions. To increase coding efficiency and execution speed, 
however, some instructions use the defined register assignments 
in Table 1. 


TABLE 1. 


Alternate 
Function 


Alternate 


Program Counter 
Stack Pointer 
Data Passing 

Y Index Register 
X Index Register 
Accumulator 
Counter 
Scratchpad 

FP1 (LSW) 
FP1 

FP1 (MSW) 
FP1 (EXP) 
FPO (LSW) 
FPO 

FPO (MSW) 
FPO (EXP) 


Notes FP1 = Floating Point Register 1 
FPO = Floating Poifit Register 0 
DW1 = Double Word Register 1 
DWO0 = Double Word Register 0 





For example, the instruction set of the HEX-29 CPU can load 
immediate, push, pop, and move indexed and direct any of the 
multiple register combinations (FP1, FPO, DW1, DWO) in one 
instruction. One mode of indexed addressing and many byte 
processing instructions benefit greatly from the alternate use of 
some registers. 


Condition Code Register 


The condition code register contains all zeros in its upper byte. 
The bit assignments in the low byte are shown in Table 2 


TABLE 2. CONDITION CODE REGISTER BITS. 


Function 
User Flag #2 


User Flag #1 
User Flag #0 


Position 


Half Sign Flag (Bit 7, MSb of low byte) 
Zero Flag 

Negative Flag (MSb of result) 

2’s Complement overflow flag 

Carry Flag (arithmetic and shift carry) 





The user flags (U2, U1, U0) are an extra feature of the HEX-29 
CPU. They are not altered by any but five special flag modification 
instructions (SETF, CLRF, COMF, POPF, LDF) These op codes 
set, clear, complement, pop, or load the flags respectively. Since 
the user flags are immune to change except by these special. 
purpose flag altering instuctions, they are excellent for passing 
status information between routines 


The half sign flag (H) is setif the result of an operation contains a 1 
in the most significant bit of the low byte; otherwise It is cleared. 
This flag is useful in many byte processing and loop counting 
routines 


If the result of an operation Is zero, the zero flag (Z) is set, or else it 
is cleared This is the most useful of all the flags and is used on 
comparisons, arithmetic and logical operations, loop counting, 
etc... 


When the most significant bit of the result of an operation Is a logic 
1, the negative flag (N) is set. Otherwise it is cleared. Note that in 
two's complement notation, the most significant bit of a number 
determines the sign of the number If it is a logic 1, the number is 
negative, if it is a logic 0, the number ts positive. 


Ifthe two’s complement result of an arithmetic operation results in 
a two's complement overflow, the V flag is set. This flag is also 
used as a general error flag by the HEX-29 CPU For example, the 
V flag 1s set if a divide by zero instruction ts attempted. In floating 
point notation, if the exponent becomes too large or small, (arith- 
metic overflow/underflow), the V flag is set to so indicate. 


The carry flag (C) is used for two purposes It is a source and/or 
destination bit in shift and rotate instructions, and as a carry-out 
bit when an arithmetic function result is too large to fit in the 
appropriate destination register. The convention with regard to 
the carry flag on addition and subtraction follows 


C flag = 1 1f 1. Binary add results in a carry out. 
2. Binary subtract results in no borrow. 


C flag = Of 1. Binary add results in no carry out. 
2 Binary subtract results in a borrow. 
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All of the condition code flags, except the user flags, have some 
special meanings in some of the complex ‘macro’ instructions. 
These are described in the detailed section on the HEX-29 in- 
struction set. 


interrupt Registers 


There are three special purpose interrupt registers in the HEX-29 
CPU. They are: 


1. Mask Register 
2. Status Register 
3. Vector Register 


These registers are command driven, thats, the register selected 
is a function of the interrupt command being executed. More 
detailed information on the nature of these registers appears later 
in this application note. 


Memory Management Registers 


A sophisticated memory management structure is embodied in 
the HEX-29 CPU. Integral to this structure is the set of 16 memory 
map registers. These 8-bit registers contain transformation val- 
ues that allow multiple users and tasks to share the processing 
time of the CPU without interacting with each other. Each task 
logged onto the HEX-29 is unique from all others through its 
memory map image. When it is chosen to run on the CPU, its 
memory map image becomes synonomous with the CPU mem- 
ory map registers. More detailed information on this aspect of the 
HEX-29 CPU appears later in this application note. 


Instruction Formats 


The instruction formats of the HEX-29 CPU are simple and few in 
number For this reason, the HEX-29 instruction set is not difficult 
to learn and use, even though it Is very extensive and quite 
sophisticated. 


Emphasis on the use of 4-bit (hexadecimal), and 8-bit (byte) fields 
in the instruction formats simplify the organization of the instruc- 
tion set. All of the instruction formats used in the HEX-29 are 
shown in Figure 6. 
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Figure 6. Instruction Formats. 


Most Instructions involve operations on 16-bit words. However, 
the HEX-29 instruction set also includes op-codes that operate on 
the following data types 


1 1 Bit (Bit) 

2. 4 Bits (Hex or Nibble) 
3 8Bits (Byte) 

4. 16 Bits (Word) 

5 32 Bits (Double Word) 
6 64 Bits (Quad Word: Floating Point) 
7. N Bits (Variable Format) 





In addition to working on the fixed length data types, there are 
many ‘macro’ instructions that operate on variable length 
character, byte, and word strings in memory. These strings can 
be either contiguous in memory or in the form of linked lists. 
Several of these ‘macro’ instructions are highly optimized micro- 
coding of the most critical routines used in high level language 
processing. 


The multiplicity of data types processed efficiently by the HEX-29 
increases tts ability to meet the diverse demands of modern 
computing 


Addressing Modes and Assembly Language 


Much of the power and simplicity of the HEX-29 instruction set Is 
derived from the large number of useful addressing modes avail- 
able for the most used basic functions such as MOV, ADD, SUB, 
INC, DEC, CMP, etc. Addressing modes specify where operands 
of an instruction are to be found and where the result Is to be 
stored. 


The 16 general purpose 16-bit registers are designated RO, R1, 
R2...RC, RD, RE, RF. These are the primary names of the 16 
registers and refer directly to the corresponding registers. In other 
words, when ‘RD’ ts written in a HEX-29 A8sembly Language 
(HAL) program, the contents of this register are used as an 
operand or destination in the instruction. 


The use of a register as a pointer to memory ts called memory 
pointer addressing. The names MO, M1, M2,...MC, MD, ME, 
MF apply to the 16 general purpose registers when they are used 
as memory pointers 


When a register points to a memory location which contains the © 
address of the memory location holding the value of interest, the 
register is said to be an “indirect pointer”. The names 10, 11, 
i2,...1C, ID, IE, IF are used to specify the 16 general purpose 
registers when they are being used with this type of addressing. 


Indexed addressing Is possible using the names ZO, Z1, Z2,..., 
ZC, ZD, ZE. The use of one of these names means that the data is 
at the address formed by adding the contents of the register 
referenced to the contents of word following the instruction in 
main memory 


Most often, when a register is used as a memory pointer (MD for 
example), or as an indirect pointer (19 for example), it is extremely 
desirable that the register auto-increment or perhaps auto- 
decrement since programs, lists, and stacks are ordered in a 
positive direction through memory. 


In HEX-29 Assembly Language (HAL) it is quite simple to specify 
that a memory, indirect pointer register, etc. is auto-incremented 
or auto-decremented by appending a ‘+’ or ‘—’ character to the 
respective register specification. 


For example. 


MOV M7+, R6 The contents of memory pointed to by R7 Is 
moved into R6. R7 ts then incremented. 


MOV RA, ME— Decrement RE Then move the contents of RA 
into the memory location pointed to by RE. 


It is significant that auto-incrementing takes place after the oper- 
ation while auto-decrementing takes place before the operation; 
(auto-post-increment and auto-pre-decrement.) 


Several very fundamental addressing modes arise from auto- 
incrementing memory and indirect pointers. Consider the follow- 
ing examples: 


A. Program Counter (RF) as an auto-incrementing pointer yields 
‘immediate addressing’. ; 


MOV MF+, RA = Move immediate into RA. 
ADD MF+, R6 = Add immediate to R6. 
MOV MF+, RF = Jump to address in immediate word. 


B. Stack Pointer (RE) as an auto-incrementing pointer yields 
‘stack addressing’. 


MOV ME+, R2 = Pop top of stack into R2. 
XOR ME+, R1— = Pop top of stack and XOR into R1. 
MOV ME+, RF = Return from subroutine! 


C. General registers used as data stack pointers. 
ADD MD+, MD _ = Add top two members of data stack + 
leave result on top of the stack. 

CMP MD+, M8+ = Compare top members of two stacks + 
remove these values from the stacks. 
= AND immediate word with the top mem- 

ber of stack pointed to by R6. 


AND MF-+, M6 


D. Program Counter (RF) as an auto-incrementing indirect 
pointer yields ‘direct addressing’. 


MOV IF+, R7 = Move direct into R7. 
ADD IF+, RC = Add direct into RC. 
MOV IF+,!1F+ = Move direct to direct. 


It should be clear that these examples represent only a few of the 
most useful of many possible uses of auto-incrementing and 
auto-decrementing with memory and indirect pointers. Careful 
study of the HEX-29 instruction set will reveal many more uses 
not examined in these examples. 


Classes of Instructions 


The instruction set of the HEX-29 includes many different func- 
tions and a multitude of addressing modes. Nonetheless, all 
instructions fall into one of two classes of instructions. The gen- 
eral register class of instructions are extremely flexible because 
of the enormous number of variations inherent in each op-code. 
The defined register class of instructions permits extremely fast 
and memory efficient code for often used functions and register 
sets. The power of the HEX-29 instruction set is derived from an 
extensive combination of the most powerful and efficient instruc- 
tions from each class. 


General Register Instructions 


In a general register instruction, the function and addressing 
mode are specified in the op-code field (upper byte). The lower 
byte then holds two 4-bit (hex) values that specify the registers 
used in the instruction. It should be clear, therefore, that for every 
general register instruction there are 256 possible specific ac- 
tions that can be performed. 


The full power of these instructions may not be evident without an 
example. A discussion of just 5 of the 256 possible variations on 
the MOV M4, R instruction will demonstrate the extreme flexi- 
bility of each and every general register instruction. Execution of 
the MOV M4, R instruction proceeds as follows: 


1. Contents of Rs are moved to the address bus. 
2. Rs is auto-incremented by one. 
3. The data addressed by Rs is loaded into Rd. 


In this instruction, Rs is used as an auto-incrementing mem- 
ory pointer, hence the M+ notation. Rs and Rd are the source 
on destination registers. Below is the set of 5 examples of 
how the one op-code can be used to implement a number of 
important functions. 


. MOV MF+, R3 (W) = Load immediate word (W) into R3. 
MOV MF+, RF (W) = Jump direct to address (W). 


APRON — 


MOV ME-+, R6 = Pop top of control stack into R6. 
MOV MD+, R4 = Load next member of list into R4. 
. MOV ME+, RF = Return from subroutine. 


Taking a few minutes to review this section and understand how 
all of these functions are achieved with the single MOV M+, R 
op-code should reveal the nature of the power and flexibility of the 
general register class of instructions. 


Defined Register instructions 


A defined register instruction is an instruction whose function, 
addressing mode, and register assignments are all defined in the 
op-code field (upper byte). The low byte is then available for use 
as an offset for short relative branching instructions, an im- 
mediate byte or character, an 8-bit index value, an 8-bit logical 
mask, etc. With this class of instructions, often only one word is 
required for the entire instruction. This speeds execution and 
improves coding efficiency markedly in most applications. It is for 
this class of instruction that the alternate register function as- 
signments appear in the model of the HEX-29 register set. An 
example of a one word defined register instruction with the two 
word instruction it can replace follows: 


ADC X,A 26 Defined Register Instruction 
ADC ZB, RA 0026 General Register Instruction 


Both of the instructions accomplish the same thing. In both cases 
RB (X index register) is used as an index register and RA (Ac- 
cumulator) is the destination operand. The value in memory atthe 
address pointed to by the sum of the X index register (RB) plus the 
hex constant 26 ts added to the contents of the Accumulator (RA) 
and the sum left in Accumulator (RA). 


The significant difference between the two instructions is that the 
defined register instruction takes only half as much code (one 
word vs. two), and executes faster since there are fewer memory 
accesses and fewer machine cycles. Very often the defined reg- 
ister instruction will be adequate for the job. But when the 
choices of registers RB and RA are not acceptable or if an 8-bit 
index offset is not large enough, the general register instruction 
would be the proper choice. It allows any register pair to be 
specified as the index register and destination/accumulator and 
has a 16-bit index offset in the word following the instruction. 


As mentioned earlier, it is largely the ability of mixing defined and 
general purpose instructions freely that makes programs written 
in HEX-29 Assembly Language very code efficient and fast. 


HEX-29 Instruction Set 


The HEX-29 instruction set is quite extensive. It not only of- 
fers all of the basic functions in a wide variety of addressing 
modes, it also includes a multitude of special purpose instruc- 
tions. These special purpose instructions cover many impor- 
tant aspects of programming including program control, 
numeric processing, string manipulation and searching, list 
processing, etc. 


Fortunately, all of these types of instructions fall into one of 
only four different instruction formats. These were shown in 
Figure 6. Table 3 shows all of the instructions for the HEX-29 
machine. 
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TABLE 3. SUMMARY OF MNEMONICS ARITHMETIC OPERATIONS. 


ADC 
ADD 
ADDB 
ADDH 
DADD 
FADD 


SBB 


SUBB 
SUBH 
RSUB 
DSUB 
FSUB 


UMUL 
SMUL 
DMUL 
FMUL 


UDIV 
SDIV 
DDIV 
FDIV 


CMP 
CMPB 
CMPBA 
CMPH 
CMPHN 
CMPHA 
DCMP 
FCMP 


NEG 
DNEG 
FNRM 
DTST 
FIST 


INC 
DEC 


Add words plus carry 

Add words w/o carry 

Add byte to word 

Add hex value (nibble) to word 

Add double word values (32 bits + 32 bits — 32 bits) 

Add floating point values (64-bit FP + 64-bit FP — 64-bit FP) 


Subtract with borrow 

Subtract w/o borrow 

Subtract byte from word 

Subtract hex value from word 

Subtract words in reverse order 

Subtract double word values (32 bits — 32 bits — 32 bits) 
Subtract floating point values (64-bit FP — 64-bit FP — 64-bit FP) 


Unsigned word multiply (16 bits * 16 bits — 32 bits) 
Signed word multiply (16 bits * 16 bits — 32 bits) 

Double word signed multiply (32 bits * 32 bits — 64 bits) 
Floating point multiply (64-bit FP * 64-bit FP — 64-bit FP) 


Unsigned word divide (16 bits + 16 bits > 16 bits + 16-bit remainder) 
Signed word divide (16 bits + 16 bits > 16 bits + 16-bit remainder) 
Double word signed divide (32 bits + 32 bits > 32-bit + 32-bit remainder) 
Floating point divide (64-bit FP + 64-bit FP — 64-bit FP) 


Compare words 

Compare byte with word 

Compare byte with byte 

Compare positive hex value (nibble) with a word 
Compare negative hex value (nibble) with a word 
Compare hex value (nibble) with another nibble 
Compare signed double word values 

Compare floating point values 


Negate word (2’s complement) 

Negate signed double word value 

Normalize floating point number 

Test signed double word value for zero + sign 
Test floating point value for zero + sign 


Increment word by one 
Decrement word by one 


Shifts & Rotates 


ASR 
ASL 

CSL 
DSL 
DSR 
LSR 

RCL 
ROL 
ROR 
VSL 

VSR 


Arithmetic shift right 

Arithmetic shift left 

Count and shift left (until MSb=1) 
Double word shift left 

Double word shift right 

Logical shift right 

Rotate closed left 

Rotate left (through carry flag) 
Rotate right (through carry flag) 
Variable shift left (0 to 15 places) 
Variable shift right (0 to 15 places) 
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TABLE 3. SUMMARY OF MNEMONICS ARITHMETIC OPERATIONS. (Cont.) 


Logical Operations 


AND 
ANDB 
lIOR 
IORB 
XOR 
XORB 
COM 
CLR2 
BTS 
BTC 
BTl 
BTT 
CLRF 
SETF 
COMF 


Boolean AND words 

Boolean AND byte with word 
Boolean inclusive OR words 
Boolean inclusive OR byte with word 
Boolean exclusive OR words 
Boolean exclusive OR byte with word 
Complement word 

Clear the specified 2 registers 

Bit set 

Bit clear 

Bit invert 

Bit test 

Clear specified flags 

Set specified flags 

Complement specified flags 


Data Movement 


MVN 
MOV 
MVM 
MVB 
LDB 
STB 
MVH 
MVHN 
LDI2 
XCH 
DXCH 
FXCH 
XCHM 
DUP 
SWT 
JAM 
SWP 
PSH2 
POP2 
PSHF 
POPF 
PSH8 
POP8 
PSHD 
POPD 
LDINT 
RDINT 
RMM 
LMM 
FMM 
BMBF 
BMBR 
BMWF 
BMWR 


Move, no flags altered 

Move, update flags 

Move multiple words 

Move a byte 

Load a byte 

Store a byte 

Move a positive nibble 

Move a negative nibble 

Load immediate 2 registers 

Exchange contents of two registers 

Exchange contents of DW1 and DWO 

Exchange contents of FP1 and FPO 

Exchange top two members of any stack 

Duplicate top member of any stack 

‘Switch’. Store register indexed and reload indexed 
Move any bit field from one word to another 

Swap high and low bytes in a word 

Push any two registers onto control stack 

Pop top two words on control stack into two registers 
Push flags (condition code register) onto control stack 
Pop top of control stack into condition code register 
Push 8 registers onto control stack 

Pop 8 registers from control stack 

Push R8, R9, RA, RB, RC, RD onto control stack 
Pop R8, R9, RA, RBRC, RD from control stack 
Load interrupt register 

Read interrupt register 

Read a memory map location 

Load a memory map location 

Fill memory map 

Block move bytes forward in memory 

Block move bytes reverse in memory 

Block move words forward in memory 

Block move words reverse in memory 


PROGRAM CONTROL 


EXR 
RTI 
BPT 
JFS 
JFC 
CFS 
CFC 
JIFS 
JIFC 
CIFS 
CIFC 
RTFS 
RTFC 
JMP 
CALL 
CEX 
BGT 
BGE 
BLT 
BLE 
*TTWB 
*TTBB 
DBNZ 
BZD 
*CBB 
BR 
BSR 
BC 
BNC 
BV 
BNV 
BN 
BNN 
BZ 
BNZ 
BH 
BNH 


Execute contents of register as an instruction 
Return from interrupt 

Breakpoint trap 

Jump if specified flags are set 

Jump if specified flags are clear 

Call subroutine if specified flags are set 
Call subroutine if specified flags are clear 
Jump indirect if specified flags are set 
Jump indirect if specified flags are clear 
Call subroutine if specified flags are set 
Call subroutine if specified flags are clear 
Return from subroutine if specified flags are set 
Return from subtoutine if specified flags are clear 
Jump to the address specified 

Call subroutine , 

Call executive (software interrupt) 

Branch if greater than 

Branch if greater than or equal 

Branch if less than 

Branch if less than or equal 

Transition table word branch 

Transition table byte branch 

Decrement and Branch Non-Zero 

Branch on zero or decrement 

Compare and branch if in bounds 

Branch 

Branch to subroutine 

Branch if carry flag set 

Branch if carry flag not set 

Branch if overflow flag set 

Branch if overflow flag not set 

Branch if negative flag set 

Branch if negative flag not set 

Branch if zero flag set 

Branch if zero flag not set 

Branch if half sign flag set 

Branch if half sign flag not set 


Miscellaneous Instructions 


NOP 
*SCNB 


*SCNW 


*SEAF 
*SEAL 


No operation for 2 to 256 cycles 

Scan for match with specified byte 

Scan for match with specified word 

Basic fixed entry length list search 

Basic variable entry length linked list search 


*These ‘macro’ instructions are examined in more detail on the following pages. 
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UMUL 


SMUL 


UDIV 


SDIV 


DADD 


DSUB 


DMUL 


SUMMARY OF SELECTED ’MACRO’ INSTRUCTIONS 


Unsigned 16-bit muitiply 

16 bits 16 bits — 32-bit answer 

R3  R2—R3 (MSW of answer) 
— R2_ (LSW of answer) 
— Ri (LSW of answer) 
— RO (MSW of answer) 


lf V=1 then RO is not zero (Answer is longer than 16 bits) 
lf N=1 then MSb of RO = 1. (No particular significance) 
lf Z=1 then answer is zero (R1 and RO are cleared) 


Signed 16-bit multiply (Two’s complement notation) 
16 bits 16 bits — 32-bit answer 
R3 R2-—-R3_ (MSW of answer) 
— R2 (LSW of answer) 
— Ri (LSW of answer) 
— RO (MSW of answer) 
lf V=1 then answer is longer than 16 bits (overflowed LSW) 
lf N=1 then answer is negative 
lf Z=1 then answer is zero (R1 and RO are cleared) 


Unsigned 16-bit divide 

16 bits / 16 bits — 16-bit answer and 16-bit remainder 

R3 / R2 — R2 R3 holds remainder 

If V=1 then an attempt to divide by zero was refused 

lf N=1 then MSB of answer = 1 

If Z=1 then answer is zero (R2 = 0 R32 need not be zero) 


Signed 16-bit divide (Two’s complement notation) 

16 bits / 16 bits — 16-bit answer and 16-bit remainder 

R3 / R2 — R2 R3 holds remainder 

If V=1 then an attempt to divide by zero was refused, or overflow 
If N=1 then the answer is negative 

lf Z=1 then the answer is zero (R2 = 0 R3 need not be zero 
R3 has sign of numerator 


Double word signed add 

32 bits + 32 bits — 32 bits 

DW1 + DWO — DWO ie. R3,R2 + R1,RO — R1,RO0 
The C flag is treated the same as in Rngle word addition 

If V=1 then a two’s complement overflow occurred 

lf N=1 then the answer is negative 

lf Z=1 then the answer is zero 


Double word signed subtract (Two’s complement notation) 

32 bits — 32 bits — 32 bits 

DW1 — DW0 — DWO0 ie. R3,R2 — R1,RO — R1,RO 

The C flag is treated the same as in single word subtract 

IF V=1 then a two’s complement overflow occurred 

If N=1 then the answer is negative 

lf Z=1 then the answer is zero 

If one divides “8000” by ‘‘FFFF’’ (—32768 + —1) the answer is ‘‘8000’’ (+.32768). However, 8000 is a negative 
number in two’s complement, so an overflow has occurred 


Double word signed multiply 
32 bits = 32 bits — 64 bits 
DW1 DWO0— DWO,DW1 ie. R3,R2 R1,RO — R1,R0,R3,R 
NOTE: The order of the answer words is as follows: 
MSW —> R2 
MSW-1— R3 
MSW -—2-—_ RO 
MSW-3— Ri (LSW) 


DDIV 


DCMP 


DXCH 


DNEG 


TST DWO 


FPADD 


FPSUB 
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SUMMARY OF SELECTED ’MACRO’ INSTRUCTIONS (Cont.) 


The reason for this seemingly unnecessary odd order concerns the results that are desired in DWO (RO,R1) at the end 
of the operation. The desired result of 32-bit math operations are nearly always 32-bit answers. However, a 32- 

bit * 32-bit multiply can generate up to 64 bits. Therefore, the least significant 32 bits of the answer are stored in 
DWO where the answer is expected on all double word (DW) instructions The most significant 32 bits must be stored 
in DW1, therefore the seemingly reversed order of storage. If the V flag = 0 at the completion of an operation, then 
only the 32 bits in DWO are significant and the user program can store this 32-bit double word without fear of losing 
significant bits. So, in the normal situation where only the least significant 32 bits of the answer is desired and the 
more significant 32 bits of the answer does not contain significant bits, the answer is where the normal convention 
specifies, in DWO If the V flag !s found set and it is desirable to save the 64-bit result rather than go to an error 
routine, a simple DXCH will exchange the contents of DW1 and DWO and leave the 64-bit answer in a logical order 
with the MSW in RO and the LSW in R3. It can then be buffered with any of the floating point register 0 buffer 
instructions. If V=1 then the answer has greater than 32 bits of significance. 

lf N=1 then the answer ts negative 

lf Z=1 then the answer is zero 


Double word signed divide (Two’s complement notation) 

32 bits / 32 bits — 32-bit answer and 32-bit remainder 

DW1 / DWO — DWO Remainder — DW1 

lf V=1 then attempted divide by zero was refused, or overflow 
lf N=1 then answer is negative 

lf Z=1 then answer is zero (DWO = 0. DW1 not tested) 


Double word compare (Two’s complement notation) 

32 bits — 32 bits — Nowhere (Update V,N,Z flags) 

DW1 — DWO — Nowhere 

The C flag is treated the same as in a single word compare 
lf V=1 then a two’s complement overflow occurred 

lf N=1 then the difference is a negative value 

lf Z=1 then the difference is zero 


Double word exchange 

Operates on any contents of DW1 and DWO 

DW1 — TEMP DWO — DW1 TEMP — DWO 
DW1 = R3 and R2 

DWO = Ri and RO 

No flags are altered 


Double word negate (Two’s complement notation) 

0000 0000 — 32 bits — 32 bits 

0000 0000 — DWO0 — DWO 

lf V=1 then a two’s complement overflow occurred DWO = 8 0000 0000 
lf N=1 then the final value in DWO ts negative 

lf Z=1 then the final value in DWO is zero 


Double word test value (Two’s complement notation) 

Set flags based upon the contents of DWO 

0000 0000 + DWO — Nowhere (Update V,N,Z) 

If V=1 then a valid 2’s complement value overflows the LSW 
If N=1 then the value in DWO Is negative 

lf Z=1 then the value in DWO ts zero 


Floating point add Double Precision (64 bits) 

Standard HEX-29 floating point format 

FP1 + FPO — FPO 

lf V=1 then an overflow in the 2’s complement exponent occurred 
lf N=1 then the answer is negative 

lf Z=1 then the answer is zero 


Floating point subtract Double Precision (64 bits) 

Standard HEX-29 floating point format 

FP1 — FPO — FPO 

If V=1 then an overflow in the 2’s complement exponent occurred 
lf N=1 then the answer is negative 

lf Z=1 then the answer is zero 
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FPMUL 


FPDIV 


FPCMP 


FPNRM 


FPXCH 


TST FPO 


SEAL 


SUMMARY OF SELECTED ’MACRO’ INSTRUCTIONS (Cont.) 


Floating point multiply Double Precision (64 bits) 

Standard HEX-29 floating point format 

FP1 FPO — FPO 

lf V=1 then an overflow in the 2’s complement exponent occurred 
lf N=1 then the answer is negative 

lf Z=1 then the answer is zero 


Floating point divide Double Precision (64 bits) 

Standard HEX-29 floating point format 

FP1 / FPO — FPO 

lf V=1 then an overflow in the 2’s complement exponent occurred or negative zero refused. 
If N=1 then the answer is negative 

lf Z=1 then the answer is zero 


Floating point compare Double Precision (64 bits) 
Standard HEX-29 floating point format 

Compare the magnitudes of FP1 and FPO 

If N XOR V = 1, then FP1 < FPO 

lf Z2=1 then FP1 = FPO 


NOTE: WE HAVE TO FURTHER DEFINE THE WAY THIS WORKS, BUT THIS INSTRUCTION WILL SET THE FLAGS 
SUCH THAT THE 2’s COMPLEMENT BRANCH ON THE EF PAGE WILL WORK!!! 


Floating point normalize Double Precision (64 bits) 

Standard HEX-29 floating point format 

The sign of the mantissa must be in the MSb of the exponent word before this instruction is executed 

Shift mantissa left and increment exponent until MSb of the MSW of the mantissa is one. (Operates on FPO only) 
lf V=1 there was a 2’s complement overflow of the exponent 

The C flag is trashed 

N=1 result is negative 

Z=1 result is zero 


Floating point exchange Double Precision (64 bits) 
Operates on any contents of FP1 & FPO (R7 thru RO) 
FP1 — TEMP FPO — FP1 TEMP — FPO 

FP1 = R7, R6, R5, R4 

FPO = R3, R2, R1, RO 

No flags are altered 


Floating point test Double Precision (64 bit) 
Standard HEX-29 floating point format 

Set the flags based upon the contents of FPO 

If N=1 then the value in FPO is negative 

If Z=1 then the value in FPO is zero 


BASIC string variable / numeric or string matrix element search 

The SEAL instruction provides a very flexible way to rapidly and efficiently search linked lists for a particular entry. 
In each entry in the list, the first two 16-bit words are ordered as follows: The first word of each entry is the link 
offset to the next entry in the linked list. The second word is the entry name word. Any 16-bit value can be used in 
this field. 


The name of the entry to be searched for must be put in the accumulator (RA) before this instruction is executed. The 
format of the instruction is as follows: 


SEAL F,Md_ where F is the literal binary value 1111. 


The destination field of the instruction (Md) specifies the register that must point to the beginning of the linked list. 
Starting at.this point, this instruction will link its way thru the list looking for a match between the word after the link 
offset word (the entry name) and the contents of the accumulator (RA). 


At the completion of the instruction, the Z flag indicates the results of the instruction in the following manner: 


Z=1 No match was found in list (End of list reached) 
Z=0 A match was found and Md is pointing to the word after the entry name that matched the accumulator 





SEAF 


SCNW 
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SUMMARY OF SELECTED ’MACRO’ INSTRUCTIONS (Cont.) 


Since the link offset word is a two’s complement value, it can link to any other location in memory. The link offset is 
equal to the difference between the address of the next link offset word and the address of the current link offset 
word, minus one. 


Note that this instruction can be used to search linked lists with entry names that are much longer than 16 bits with 
ease. For example, tf the entry names to be matched are 2 words long, all that need be done Is to compare the word 
at which the pointer is aimed with the second word of the desired variable name. If it matches, then the pointer now 
points to the first element in the list after the double word entry name. If it does not match, the search can be 
continued by backing up the pointer to the link offset of the current entry and re-executing the SEAL instruction. 


At the completion of the instruction, the contents of the register specified by the Md field in the instruction will 
contain the address of the word AFTER the variable name in the list entry that matched the one in the accumulator 
(RA). At the completion of the instruction the Z flag will indicate the results of the instruction execution. If the Z flag 
is at a zero level, the search was successful and the pointer to the table (Md) contains the appropriate value. On the 
other hand, if the Z flag is set to a one level, no match to the variable name in the accumulator was found anywhere 
in the linked list. 


LO VN da da... da da LO VN da... da da LO VN... 


LO = Link Offset word 
VN = Variable Name word 
da = data entries irrelevant to instruction 


Basic fixed link offset variable search 


The SEAF instruction provides a very flexible way to rapidly and efficiently search lists for a particular entry. It Is 
Slightly different from the SEAF instruction in that the link offset word is not imbedded in the list entries. Instead, this 
instruction assumes that all list entries are of the same length (even though the internal formats may vary). The value 
of the link offset is the immediate word following the BASF op code word. 


Perhaps the most obvious use of this instruction is for searching a numeric variable list for a specific variable name 
followed by the value. The lists entries can be any length, so single and double word integers and floating point lists 
can all be handled with equal ease, but not all with the same instruction since the list entries will not be the same 
length for all of these. 


The link offset word following the instruction is a two’s complement number. Therefore, any fixed length can be 
searched forwards or backwards in memory. The link offset constant equals the number of words in each list entry, 
or its 2’s complement for a backwards search. 


Again the variable name word to be searched for must be put into the accumulator (RA) before the BASF instruction 
is executed. And the contents of the destination field register (specified by Md) points to the first element of the list. 
The form of this instruction is shown below: 


SEAF 0,Md where 0 = binary 0000 

scan for word 

The SCNW instruction is of the following form: 
SCNW Ms,Md 


This instruction scans a table of words (pointed to by Rs) for a match with the contents of the accumulator. Each 
time a word is fetched from the table, Rd is incremented. If Rd contains zero at the beginning of the instruction, then 
it will contain the number of the words searched in the source table before a match with the accumulator occurred. 


Alternatively Rd may contain a pointer to another table. When a match between the accumulator and the source table 
occurs, Rd will point to a corresponding entry in the ‘destination’ table. 


If the source list pointer and the destination list pointer are the same, then the two tables are interleaved; ie. the 
combined list would start: 


Source list word #1 
Destination list word #1 
Source list word #2 
Destination list word #2 
Source list word #3 
etc. 
etc. 
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SUMMARY OF SELECTED ’MACRO’ INSTRUCTIONS (Cont.) 


This instruction can be very useful in command processing routines and for searching lists that are not linked within 
the list itself (see BASS and BASF). 


The last entry in the source list must be a zero. If no matches were found previous to this zero word, then the Z flag 
is set. If the Z flag was not set, then a match was found and the pointers are valid. This instruction is interruptable 
on a word by word basis. 


Scan for byte 


The scan for byte instruction (SCNB) works identically to the scan for word instruction except that the source list 
contains bytes packed into words. Thus the source list is only half as long as the destination list (if there is one). 


Note that both lists must start on word boundaries. Only the low byte of the accumulator is used in the compare with 
the source bytes. The contents of the accumulator are not affected by the instruction. This instruction is interruptable 
on every other byte that is compared. The Z flag has the same meaning as for the SCNW instruction. 


Instruction Matrix 


A convenient way to present all of the basic op-codes of the 
HEX-29 CPU is by way of an ‘instruction matrix’. The eight-bit 
op-code in the upper byte is broken into two nibbles. The most 
significant nibble of the op-code appears on the left side of the 
matrix shown in Figure 7. The lower nibble appears along the top 
row. The second matrix shown in Figure 8, is called the ‘extended 
function’ matrix. In the HEX-29 CPU, the low byte of the instruc- 
tion word is interpreted as an ‘extended function’ op-code if the 
upper byte ts an ‘EF’ hex. 


Memory Management 


The HEX-29 incorporates a sophisticated memory management 
structure. Though very clean and elegant in implementation, the 
capabilities of the processor are greatly extended by this circuitry. 
Transparent to the user not requiring its many features, this 
structure is vital to many very important applications; most sig- 
nificantly the support of multi-user, multi-task, time-sharing 
operations. 


To all programs executing on the HEX-29, all memory addresses 
are 16 bits long. But before these 16 lines reach the system bus, 
they pass through the memory management section of the 
HEX-29 CPU. In this circuitry, the most significant four bits 
(A15-A12) are ‘mapped’ into eight bits on the bus (a ‘write-protect’ 
bit (WP) and seven address lines (A18-A12)). The net increase of 
three address bits expands the total addressable memory space 
to 512k words or 1 Megabyte. The WP bit is used to write protect 
the memory in blocks as desired by the executive program. 


Since each of the 16 locations in the memory map represents a 4k 
word block (or page), up to 64k words can be addressed by a 
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program at any time. Any location in the memory may contain any 
8-bit value, so memory that is contiguous to a program need not 
be contiguous in physical memory. For clarity, Figure 9 shows 
schematically how this ‘memory mapping’ works. 


The low 4k words of physical address space is reserved for the 
nucleus of an operating system; also called an executive or 
supervisor program. This is called physical page zero. The con- 
tents of the memory map can only be altered if the low location of 
the memory map contains all zeros. Since this is synonymous 
with the physical page zero address block, only the executive 
program is able to change the contents of the memory map. And 
since all I/O devices and channel control blocks are located in 
physical page zero, all |/O must also be done through the execu- 
tive program. Likewise, all hardware and software interrupts in- 
voke the supervisor automatically. 7 


Because of this simple but fool-proof security scheme, complete 
protection of all users memory space and I/O devices can easily 
be maintained by the executive program. 


Also note that the supervisor program can safely make programs 
that are re-entrant available to several users simultaneously as 
long as it write protects the code. Since user programs are often 
no larger than the host program under which it is running, this 
technique can result in a savings of 30% to 50% in system 
memory usage. 


Occasionally, for special purposes, a single user may wish sole 
access to the entire resources of the system. Examples would 
include programs too large to run ina single user's 128k bytes of 
memory. Or perhaps a new I/O access method. In any case, it is 
possible for a single user on the system to gain complete control 
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Figure 7. HEX-29 Instructions. 
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Figure 8. HEX-29 EF Instructions. 
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Figure 9. Memory Mapping Program Address (Logical Address). 


and access to the system by assigning himself as the executive 
program. This can only be accomplished after a system reset. 
Hence only those with physical access to the computer (and who 
have a reset key) can accomplish this operation. This user is then 
empowered with all of the features and capabilities of the 
machine with no limitations. Direct access to ail of the system I/O 
devices, the entire interrupt structure, the memory map, etc., is 
then at the command of the single user in the executive or 
supervisor mode. 


Most often, each user needs only one or two 4k pages of memory 
in addition to the host program which is probably shared. Thus it 
would be very wasteful if each user were to have access to a full 
65k words of physical memory space. For this reason, a page of 
physical memory has a special designation in the system. 


The highest possible physical address block when write pro- 
tected is called the ‘invalid access’ block. Whenever a user ac- 
cesses memory that the supervisor has mapped into the invalid 
access block, the processor ‘traps’ to a special location in the 
supervisor program called the ‘invalid access trap’. This occurs 


before the current machine cycle is completed. This is treated 
identically to an interrupt by the processor except that the current 
instruction is not completed. 


Any number of actions can be taken by the supervisor at this time. 
This will usually depend upon the resources of the machine and 
the circumstances under which the problem arose. For example, 
the executive program could inform the program that its memory 
space had been exceeded, or perhaps just allocate another block 
of memory to that user's memory map and continue the execution 
of the offending program. A more detailed discussion of the 
sequence of events that takes place upon an invalid access 
appears in the section on the interrupt structure of the HEX-29 
CPU. 


The highest physical address page, when not write protected, is 
called the ‘dead page’. No action of any kind takes place in this 
block and there is no memory there for the program to reference. 
Any number of pages from any number of users may be assigned 
to this physical page without fear of interaction. This is the block 
that will normally be assigned by the executive program to all user 
areas that are not needed or are not to be used. 





Interrupt Structure 


The HEX-29 CPU contains a powerful interrupt structure. As with 
memory management, this aspect of the CPU operation is largely 
transparent to users of the system. In most applications the HEX 
OPERATING SYSTEM FOR TIMESHARING (HOST) program 
services all interrupts. Nonetheless, it is useful to know the basic 
structure of the interrupt system. The three types of interrupts 
serviced by the HEX-29 CPU are examined in the following 
paragraphs. 


The hardware interrupts are caused by signals from physical 
devices outside of the processor. These signals, generated by 
peripherals, their controllers, or the real time clock, serve to notify 
the CPU of some condition or requirement of the interrupting 
device. 


The HEX-29 CPU has eight hardware interrupts. They are indi- 
vidually maskable and are prioritized into eight levels. Each prior- 
ity level has its own vector associated with it. In other words, each 
interrupt level has a corresponding memory location through 
which program control is passed upon that level interrupt. These 
memory locations are within the defined executive page (physical 
page 0) and thus all interrupts cause the HEX-29 to switch into 
executive mode automatically. The eight hardware interrupt 
levels and the associated memory locations are shown below. 


Hardware Interrupt Level 
Highest Priority 7 


Memory Location of Vector 


Lowest Priority 


So, for example, when an interrupt occurs on level 3, the HEX-29 
CPU will enter supervisor mode, save the users PC and SP, and 
cail the appropriate service routine at the address stored in mem- 
ory location 0403,,. 


Normally, each hardware interrupt level is reserved for a class of 
devices such as hard disc controllers, floppy disc controllers, 
serial channels, etc. If, for example, there are eight serial devices 
that are interrupting on level 0, the service routine is required to 
locate the one (or more) devices that are requesting service on 
that interrupt level and processes them accordingly. This could be 
done by polling all the serial devices whenever the interrupt was 
received. A more efficient technique, used in the HEX-29 system, 
is to further prioritize the like devices on a given interrupt level. 
Then when an interrupt occurs, a vector is read by the executive 
program that instantly informs it of the highest priority device 
requesting service on that level. When that device is serviced, the 
vector is read again to locate any other devices in need of service 
(if any), and finally resumes normal program execution when all 
devices are serviced. 


A software interrupt is an instruction that, when executed, causes 
an interrupt to occur. The mnemonic used for this op-code in the 
HEX-29 CPU is ‘CEX’, which stands for ‘call executive’. This 
instruction passes an 8-bit vector to the ‘HOST’ operating system 
which is used to determine the action requested by the program 
executing the CEX. Except that this interrupt is caused by a 
program rather than a physical device, the CEX operates in the 
same manner as a hardware interrupt. It vectors through memory 





location 040C. A pseudo software interrupt is the breakpoint 
‘BPT’ instruction which vectors through memory location 040B. 
The BPT instruction does not pass an 8-bit vector to the executive 
and is thus useful in program debugging. 


The third type of interrupt is called a ‘trap’. A trap takes place 
when certain conditions occur that require the processor's im- 
mediate attention. For example, if the program currently running 
on the CPU tries to execute an op-code for which there is no 
defined instruction, an ‘invalid instruction trap’ occurs. This is 
essentially a service to notify a user that his program was defec- 
tive and that an attempt was made to execute an op-code which 
has no meaning. These locations are left blank in the instruction 
matrix since they can subsequently be defined as new instruc- 
tions. This ‘trap’ vectors through memory address 040D and acts 
identically to all other interrupts. The only other trap in the HEX-29 
CPU is the ‘invalid memory access’ condition. This is discussed in 
more detail in the previous section on memory management. The 
‘invalid memory access’ trap vectors through memory address 
0408. 


Table 4 shows the memory locations that are defined in the 
HEX-29 for interrupt handling. 


TABLE 4. INTERRUPT MEMORY LOCATIONS. 


Memory 
Location System Defined Uses 


Reserved 

Reserved 

Vector for invalid instruction trap 
Vector for call executive (CEX) instruction 
Vector for breakpoint (BPT) instruction 
Temperature storage for user stack pointer 
Temperature storage for executive stack 
pointer 

Vector for invalid memory access trap 
Vector for hardware interrupt level 7 
Vector for hardware interrupt level 6 
Vector for hardware interrupt level 5 
Vector for hardware interrupt level 4 
Vector for hardware interrput level 3 
Vector for hardware interrupt level 2 
Vector for hardware interrupt level 1 
Vector for hardware interrupt level 0 






















Again, note that all interrupts are processed identically so that the 
one return from interrupt (RTI) instruction properly terminates all 
interrupt service routines. 


DMA/REFRESH CONTROL 


In order that an efficient multi-user or multi-task system be im- 
plemented, it is necessary that the processor not be burdened 
with the relatively slow transfer of programs and data between 
system memory and mass storage devices such as floppy and 
hard disks. For this reason, the controllers for these devices are 
designed with a high degree of intelligence and self-reliance. 
These controllers take virtually all of the burden of mass storage 
transfers upon themselves. This frees the HEX-29 CPU to exe- 
cute programs for all users not waiting for these mass storage 
transfers to take place. Because these controllers are essentially 
separate special purpose microprogrammed CPUs, they are 
often called ‘peripheral processors’, ‘channel processors’, or just 
‘channels’. 
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For this scheme to be effective, both the CPU and the channel 
processors must be accessing system memory concurrently. 
Fortunately, the inherent structure and operation of the HEX-29 
CPU is eminently suited to this requirement. 


In every instruction there is at least one machine cycle during 
which the HEX-29 CPU is decoding or internally executing an 
instruction. During these machine cycles the CPU does not use 
the system bus; the system bus and memory are available for 
access by devices other than the HEX-29 CPU. This is called a 
‘Free DMA cycle’ or ‘bus available’ cycle. During these machine 
cycles a channel processor may read or write memory without 
interfering with, or assistance from the HEX-29 CPU. The act of 
accessing system memory by any device other than the CPU is 
called ‘direct memory access’ or DMA since the channel pro- 
cessor is directly accessing system memory without CPU assis- 
tance or intervention. 


Resident in the HEX-29 CPU is a very clean, very powerful 
multi-level prioritized DMA structure. Within this structure up to 
ten groups of devices can share the system bus on a priority 
basis. Normally the priority levels are assigned on the basis of 
transfer speeds . . . the faster the device is able to support mem- 
ory transfers, the higher the priority it is assigned. In this manner 
several channel processors can access system memory concur- 
rently at the intervals they require. The DMA structure of the 
HEX-29 CPU can support very high combined transfer rates with 
multiple DMA devices using this technique. With high speed 
memory, the HEX-29 CPU need not even slow down its program 
execution to support a concurrent combined DMA transfer rate of 
4 Megabytes per second. With slower memory, this figure drops 
to about 2 to 3 Megabytes per second. Even this slower rate 
corresponds to concurrent DMA by one high speed hard disk plus 
several floppy disks plus room to spare. Still, the CPU can be 
halted, if necessary, to achieve combined DMA rates of up to 12 
Megabytes per second maximum. 


The support of dynamic memory in the HEX-29 system is simpli- 
fied by signals associated with this DMA structure. Whenever 
there are no devices requesting the bus for DMA, a signal on the 
bus indicates this condition. Dynamic memory refresh controllers 
can take advantage of these unused free DMA cycles to refresh 
internal dynamic RAM chips if desired. Even when very heavy 
use of the bus by DMA devices occurs, it is unlikely that too few of 
these unused free DMA cycles will be available for the dynamic 
memory refresh controllers. In this event, however, another signal 
can be used to disable all other DMA priorities and allow the 
refresh controllers as much time as is required. 


SYSTEM BUS AND TIMING 


When specifying the bus signals and their timing relationships 
during the early design stage of the HEX-29 CPU, utmost atten- 
tion was paid to simplicity and reliability. The result is that there 
are very few signals required to interface to the bus properly, and 
the timing requirements are quite straight forward and easy to 
meet. 


The following section is a description of the mnemonic names and 
functions of the HEX-29 system bus signals: 


System Bus 


A18-A0 
(Address Bus) 


Three-state outputs. A18-A0 are the 19 physical 
address lines of the HEX-29 system address 
bus. A18 is the most significant bit, AO is the 
least significant bit. These outputs are three- 
stated whenever the bus is available (BA is low). 


D15-D0 
(Data Bus) 


WP (also WE) 
(Write Protect) 


R/W 
(Read/Write) 


VMA 
(Valid Memory 
Access) 


CLK 


SDMA 


BA 
(Bus Available) 


STR 
(Stretch Clock) 


CLR 
(Clear) 


17-10 
(Interrupts) 


Three-state and bi-directional input/outputs. 
D15-D0O are the 16 lines that make up the 
HEX-29 system data bus. D15 is the most sig- 
nificant bit, DO is the least significant bit. 


Three-state output. WP is used to protect 
areas of memory from being written. Practically 
speaking this signal is active-LOW and would 
have been called WE (Write Enable) if not for 
possible confusion with the read/write signal 
which also must be LOW to write memory. 


Three-state output. The R/W signal determines 
whether a read or write operation is performed. A 
LOW level of the R/W line indicates a write 
memory is to be performed if VMA (valid memory 
access) is also LOW when the system clock 
(CLK) goes LOW. 


Three-state output. VMA is LOW during all 
cycles that a memory access (read or write) 
will be performed by the processor. 


Output, not three-state. CLK is the system 
clock. All timing in the HEX-29 system is defined 
relative to this signal. For convenience, the 
period of each machine cycle that the clock is 
high is called ¢, (phase 1) and the period that it 
is low is called @2g (phase 2). All system ‘chip 
selects’ are derived from this signal. 


Output, not three-state. SDMA is mnemonic 
for ‘synchronize direct memory access’. This 
bus signal is LOW the cycle before DMA is per- 
missible. The sole purpose of this signal is to 
notify DMA devices early of an upcoming ‘free 
DMA’ cycle. This will make it easier to ‘grab the 
bus’ very early in a ‘free DMA’ cycle to improve 
the address generation timing. 


Output, not three-state. BA is LOW on all 
cycles during which DMA is permitted by the 
CPU. When BA is LOW, all three-stateable out- 
puts from the HEX-29 CPU card are turned off 
and control is relinquished to DMA devices for 
the current cycle. BA is mnemonic for ‘bus avail- 
able’. 


Input to HEX-29 CPU. When an addressed de- 
vice is not fast enough to be reliably accessed 
(read or written) within the minimum access time 
of the HEX-29 CPU, it should pull the STR signal 
LOW. For each 40ns that STR is held LOW, the 
system clock is lengthened by 40ns and thus the 
access time required of the addressed device. 
This signal can be held LOW for as many as 
40ns increments as required to meet the access 
time of the addressed memory or I/O device. 


Output, not three-state. CLR is a LOW level 
pulse which is just a ‘cleaned up’ RESET signal. 
Any device that requires an initialization pulse 
should use this line. 


Inputs to HEX-29 CPU. 17-I0 are the eight 
hardware interrupt inputs. I7 is the highest prior- 
ity and IO is the lowest. These inputs are nega- 
tive edge catching; that is, an interrupt signal is 
recognized by the interrupt circuitry in the 
HEX-29 CPU when the line goes LOW. These 


R7-RO 
(DMA 
Requests) 


Q7-Q0 
(DMA 
Acknowledge) 


NRQ 
(No DMA 
Request) 


DDMA 
(Disable DMA) 


HALT 


FETCH 
(Fetch 
Instruction) 


RESET 


OSC 
(Oscillator) 


lines should be driven by open collector outputs 
so that multiple devices can interrupt on the 
same priority level. 


Inputs to HEX-29 CPU. R7-RO are the eight 
DMA request inputs. R7 is the highest prior- 
ity, RO is the lowest. These lines are active- 
LOW; i.e., a LOW level requests DMA time. 


Outputs, not three-state. Q7-Q0 are the eight 
DMA acknowledge lines that reply to the cor- 
responding DMA request lines (R7-RO). A reply 
to the highest requesting priority is acknowl- 
edged by a LOW level on the corresponding 
acknowledge line. Only one of these lines will 
be LOW at any given time; 1.e., the highest 
priority request gets the acknowledge. 


Output, not three-state. NRQ is LOW when no 
DMA requests (R7-RO) are being received. This 
is used primarily as a signal to dynamic memory 
refresh controllers that a refresh may be per- 
formed on any ‘free DMA’ cycle. 


Input to HEX-29 CPU. When DDMA 1s pulled 
LOW, no DMA requests are acknowledged. Es- 
sentially this line ts just the highest priority DMA 
request line — except there is no corresponding 
acknowledge signal. This signal is normally re- 
served for dynamic memory refresh controllers. 
If the refresh interval is about to expire and some 
locations have not yet been refreshed, this line 
can be pulled LOW to disable all other DMA 
devices and assure adequate time to refresh the 
remaining locations. Note that NRQ is not LOW 
when DDMA is active (LOW). The DDMA line 
should be driven by open collector outputs. 





Input to HEX-29 CPU. When pulled LOW, 
the HALT input will cause the processor to ter- 
minate program execution at the conclusion of 
the current instruction. At this time the bus will 
become continuously available for DMA as all 
three-state outputs of the HEX-29 CPU will turn 
off and BA will go active (LOW). This line can be 
held LOW indefinitely. When released, the pro- 
cessor will continue program execution. This line 
should be driven by open collector outputs. 





Output, not three-state. This signal is LOW only 
on memory read cycles when an instruction is 
being fetched from system memory. Otherwise 
this signal is normally not used except during 
system development and debugging for single 
instruction execution. 


Input to HEX-29 CPU. This is the signal 
from which system reset (CLR) is derived. Nor- 
mally this input is simply grounded with a 
pushbutton or keyswitch to reset the HEX-29 


system. 


Output, not three-state. This ts the crystal con- 
trolled master oscillator from which the system 
clock is derived. The period of this oscillator is 
normally 40ns. (25MHz). 


System Timing 


In any microprogrammed system which must interface to a 
number of external devices (as a CPU must), it is critical that 
considerable forethought be given to the methods of inter-device 
communication. It is quite common to design and build devices 
that operate with very high degrees of reliability — only to find that 
overall system reliability is inadequate when the various devices 
are interfaced. 


One of the utmost goals in designing the HEX-29 CPU was to 
develop an extremely reliable, easy to use, system bus definition. 
Simplicity and reliability go hand in hand and this is reflected in the 
HEX-29 system bus. Perhaps the single most important decision 
in this regard was to define that all memory and 1/O device 
accesses by the processor or DMA devices would share one set 
of timing rules. In other words, one set of timing specifications 
applies to any kind of access of any device by any other device. 
Some systems have different timing requirements for all sorts of 
reasons; a few examples are listed here. 


1. Memory read timing is more critical (shorter) if the memory 
being fetched is an instruction. 

2. Variations exist in the set-up and hold times required on 
read memory vs. write memory cycles. 

3. Memory devices and I/O devices use some different signals 
and timing specifications. 

4. DMA devices are required to meet a different set of timing 
requirements than the processor. 

5. Interrupt processing routines violate the normal memory 
access techniques. 


Special cases carry special problems and should be avoided like 
the plague. It is always best and easiest to have all devices and 
situations share one set of control signals and one set of timing 
relationships. Another good practice put into effect on the HEX-29 
CPU is the exclusive use of active-LOW bus signals. This is 
important in many respects. First, bipolar logic IC’s can sink (pull 
LOW) far more current than they can source. Thus any noise 
spikes need to carry far more energy to force the signal into an 
invalid level. Secondly, all signals that three-state (turn-off) will be 
pulled-up (float) to the inactive level. Furthermore, this scheme 
tends to reduce the power required by bus signal drivers and 
therefore reduce heat dissipation. 


Physical design is also important to system reliability. It is wise to 
use four layer PC cards with GND and Vcc planes as the internal 
layers, as do all of the HEX-29 system cards. An additional 
feature of the HEX-29 system bus is that all signals are interlaced 
with GND traces that return directly to the internal GND plane next 
to each bus signal. System termination should also be provided 
whenever signals must travel more than 18”. Bypass capacitors 
should abound on all system cards, one per three IC’s as a 
minimum. The HEX-29 averages one per IC. 


The timing of each machine cycle in a HEX-29 system is a 
combination of synchronous and asynchronous characteristics. 
Actually, all signals are synchronous with — or are synchronized 
by — the master oscillator from which the system clock is derived. 
Thus, despite the fact that some signals seem to be asynchron- 
ous, they are actually synchronized automatically with the system 
clock. The simplicity of this approach will become clear once the 
relationship of all signals to the system clock is explained. 


The conventions regarding the HEX-29 system clock are very 
simple. All machine cycles begin when the system clock goes 
HIGH and end simultaneously with the begining of the next 
machine cycle. The period of time that the system clock (CLK) is 
HIGH is called ¢,; (phase 1) and the period of time that it is LOW 
is called dp (phase 2) See Figure 10 for clarification. 
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During all memory and I/O accesses, the processor (or DMA 
controller) must guarantee that all address lines and control sig- 
nals are valid for at least 20ns before the end of ¢, (falling edge 
of clock). Depending upon the addressing mode, the processor 
will require a variable period of time to generate a valid address. 
Thus it is the responsibity of the processor to control the period 
of @, to meet its requirements. If no external accesses are made 
by the CPU, ¢, and ¢¢2 will last only 80ns each unless a DMA 
device takes control of the bus on that cycle and requires longer 
times. 


Similarly, d2 is controlled by the memory and I/O devices on the 
bus. If none are being accessed on a particular machine cycle, 
no control need be exercised on the system clock and ¢@p will last 
for 8Ons. However, when accessed, many memory and I/O de- 
vices more than 80ns to perform a successful read or write 
operation. They must be able to lengthen ¢2 of the system clock 
to increase the access time appropriately. This is accomplished 
with the STR bus signal. When a device is accessed that re- 
quires that @2 be longer than 80ns, it must bring STR LOW 
within 50ns of the falling edge of system clock (i.e., 50ns into 
oo). For every 40ns that STR is held LOW, the system clock is 
held in its present state for an additional 40ns. ¢2 can thus be 
extended indefinitely as required by the access time of the ad- 
dressed device. ¢; can also be extended in 40ns increments 
with the STR signal if so required by DMA devices with slow 
address generation times, or the like. 


A DMA device must gain access to the bus before it can access 
the memory location that it desires. This is very simple. It simply 
pulls its DMA request line LOW and waits for the corresponding 
DMA acknowledge signal to go LOW in reply. Then, at the be- 
ginning of the first machine cycle which finds these signals plus 
SDMA LOW, the DMA device has been granted access to the 
bus and may immediately generate the appropriate signals on 
the address, data, and control buses to accomplish the transfer. 
The memory device being accessed does not care whether it is 
the processor or a DMA device on the bus since the bus signals 
and timing used by the memory card is identical for both. Thus it 
controls #2 with the STR signal as necessary and the access is 
completed in exactly the same manner as if it had been the 
processor controlling the bus. The Boolean equation for a DMA 
device gaining access to the bus follows — and Figure 11 is a 
schematic showing how easy the implementation can be. 





Qx * Rx * SDMA « CLK = DMA device has access for the 
current cycle 
X = any DMA priority level 
The timing relationships for the HEX-29 bus are shown in 
Figure 12. 





PULSE THIS 
LINE TO 
REQUEST DMA 


+5V 


CANCEL REQUEST 


ENABLES DMA 
ADDRESS BUFFERS 
AND CONTROL 
SIGNALS ONTO 
BUS WHEN LOW 


Figure 11. DMA Bus Signals. 


INTERNAL OPERATION 
Block Diagram 


The block diagram of the HEX-29 CPU (Figure 13) shows the 
following functional modules: 


System Clock 
Microprogram Control 
Word Memory (Control Store) 
Am2901A Bit Slice ALU/Register Sets 
ALU Arithmetic Carry In Control 
Shift and Rotate Linkages 
Condition Code Control 
Am2901A Output Bus 
a. Data Output Latches 
b. Address Latches 
c. Memory Management RAM 
d. Condition Code Register 
9. Am2901A Input Bus 
. Data Bus Input Registers 
. Byte Swap Input Registers 
Microword Data Registers 
. Clear Byte/Bit Set Logic 
e. Instruction Decode PROMs 
f. Condition Code Register 
10. Interrupt Control 
11. DMA/Refresh Control 


QA Oe a ee 


a0 0 


Sections 8 and 9 are more difficult to isolate on the block diagram 
since they are the buses that connect many function modules 
together. A full detailed schematic of the HEX-29 is shown in 
Figure 14; a fold out drawing at the back of the chapter. A 
discussion of the function of each of the above modules follows. 


System Clock (Figure 15) 


All timing in the HEX-29 CPU ts controlled by the system clock. 
The positive going edge of the system clock (LOW-to-HIGH 
transition) marks the end of one machine cycle and the beginning 
of the next. All input signals to the HEX-29 CPU from the system 
bus are captured on this edge. The next microinstruction is 
clocked into the pipeline register on this edge. 
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Figure 13. System Block Diagram. 
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Normally a system clock is a simple square wave or more com- 
plex waveform with a fixed period and duty cycle. But the system 
clock of the HEX-29 CPU is microprogrammed. In other words, 
the period and duty cycle are selected by microword bits in each 
microcycle. The advantage of this approach is one of through-put 
(speed). 


In any CPU, some internal operations require longer to execute 
reliably than others. And one or more of these operations requires 
the maximum length of time to complete reliably. This is called the 
worst case delay path or “critical path”. Normally the period of 
time required to perform this “critical path” operation is chosen as 
the clock period for all instructions. 


Since the “critical path” operation may take a factor of 30% to 
100% longer to execute than typical operations, it is clear that 
much processor time is being wasted in any typical program. Two 
microword bits are used to control the HEX-29 microprogrammed 
system clock so that each microcycle lasts only as long as 
necessary for the operation being performed. An overall speed 
gain of about 30% to 40% is realized with this technique. This 
was discussed in detail in Chapter Il and Chapter III. 


The master oscillator from which the system clock is derived is a 
25MHz crystal controlled oscillator. Phase 1 (6,) of the system 
clock cycle (Figures 10 and 12) is programmed to be 2, 3, 4.or 5 
times the 40ns fundamental period of the oscillator. The duration 
of dz of the system clock is 80ns. Since main memory will rarely 
be as fast as 80ns access time, a method to allow system mem- 
ory cards to lengthen > is also provided with the STR bus 
signal. When the STR signal is low, the Am74S161 is disabled 
from counting and the state of the clock will not change until it is 
released and it counts out normally. 


The conventions regarding the system clock are very simple and 
were chosen as the easiest to interface with a variety of memory 
and 1/O devices. 


All machine cycles begin when the system clock goes HIGH. 
The period of time that the clock remains at a HIGH logic level is 
called 4. do is the period that it is LOW. During all memory 
access (and |/O since I/O is memory mapped), the processor 
guarantees that all address lines and control bus signals (R/W, 
VMA,WP, etc.) are valid and stable at least 20ns before the end 
of @,. In other words, the CPU must make all bus signals valid at 
least 20ns before do begins. 


Depending upon the addressing mode being used, the pro- 
cessor will require more or less time to make all necessary 
signals to the system bus and memory cards valid. 


For example, indexed addressing requires an arithmetic opera- 
tion from the Am2901B’s rather than logical operations or a di- 
rect pass, therefore indexed addressing is bound to take slightly 
longer than immediate, direet, or pointer addressing. 


It is for these indexed operations and some others that ¢, can 
be lengthened in 40ns increments by microword bits ST, and 
ST. So the processor controls the system clock during ¢, to 
meet its requirements. When there is no memory access, the 
minimum 80ns for , is generally more than adequate. Simple 
addressing modes require 80ns-120ns. The most complex ad- 
dressing modes can take 160ns to 190ns using the worst case 
specs for all IC’s in the address generation path. 


At the end of ¢, (the beginning of 2), the processor relin- 
quishes control of the system clock to the memory or I/O device 
that is being accessed. Since I/O is mapped into normal memory 
space, there is only one set of timing rules for both memory and 
I/O accesses. If no more than 80ns is required to properly com- 
plete the read or write operation, then 2 will last only 80ns. But 


the access time of most main memory cards will be greater than 
80ns so a way of increasing the duration of ¢o is provided with 
STR bus signals. 


If this signal (STR) is pulled LOW within the first 50ns of do, do 
will be lengthened by 40ns for every 40ns that STR is held LOW. 
Thus 2 can be extended indefinitely to match the access time 
of the device being addressed. Naturally this input should be 
driven by open collector outputs so that all cards can share the 
one STR line. 


Though the STR signal is intended to be used during ¢2 on 


memory reference cycles, it works in an identical fashion during 
o@,. This can be used to advantage by DMA controllers that 
require more than 60Ons to generate valid address, data, or con- 
trol signals on transparent DMA cycles. 


A jumper option on the microprogrammable system clock allows 

the default period of do to be increased from 80ns to 120ns on 

memory reference cycles only. This is useful in systems where 
no memory or I/O devices have access times of 80ns or less, 

and/or when more than 50ns is required to pull STR LOW to 

lengthen ¢po. Figure 16 is a table of the 4, and default 42 periods 
available with the microprogrammed clock on the HEX-29 CPU. 

Default dbo 
Default Period with VMA 
ST1 | STO | VMA | 4, Period | ¢2 Period | Option Jumpered 
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Figure 16. Microprogrammed System Clock Timing. 


Microprogram Control 


The microprogram control section (Figure 17) of the HEX-29 CPU 
performs several functions; they are: 


1. System reset and initialization 

2. Interrupt and halt control 

3. Machine level instruction to microinstruction mapping 
4. Microinstruction sequencing and microsubroutining 
5. Invalid Access Memory Management Trap 


When the system reset button or keyswitch is closed, the input to 
a one-shot is pulled LOW. When it is released, the rising edge 
triggers a 500 ys pulse. This is synchronized with the system by 
gating it through a flip-flop driver by system clock. The resulting 
signal is used to zero the outputs of the Am2909 microprocessor 
sequencer. Thus, when the one-shot times out, the microprogram 
will begin execution at microaddress 000. The microcode needed 
to initialize the system is stored at this and the following several 
microaddresses and assures the proper system start-up. 


Each time a machine level instruction is fetched, the micro- 
program control logic checks for a hardware interrupt or halt 
signal from the system bus. If either signal is active, the micropro- 
gram branches to the appropriate microinstruction address to 
execute the appropriate microcode to service the request. The 
interrupt routine will buffer user registers, switch to supervisor 
mode, and call a machine level routine through a vector table 
element as defined by the priority level of the interrupt. If the halt 
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signal is pulled LOW, the external system bus is released to DMA 
devices or refresh controllers until the halt bus line is released and 
the program continues execution. 


When an instruction has been fetched and there are no interrupts 
or halt signals pending, the microprogram must begin executing 
microinstructions at a new microaddress. This microaddress is a 
function of the machine instruction to be executed. The ‘“‘map- 
ping” of the machine level instruction into a microaddress is done 
courtesy of the Am27S29 instruction decode PROM’s. The op- 
code is placed on the PROM address lines and the microaddress 
appears at the outputs which are connected to the direct inputs to 
the Am2909’s. The Am2909’s simply pass this microaddress to 
the microword memory by executing a Branch to Address on 
direct inputs function. 


This, and all other microprogram sequencer operations are 
selected by the outputs of the microprogram branch PROM which 
is driven by microword bits. This PROM, an AM27S21 contains 
the output combinations required to execute a variety of micro- 
program control functions including microbranching, micro- 
subroutining, and two-way microbranching either unconditionally 
or upon condition code bits selected by microword bits. The 
function code for this PROM is shown in Figure 18. 


As part of the multi-user, multi-task time sharing capabilities, the 
HEX-29 CPU provides an invalid memory access trap. In this 
structure, the executive program can assign any unused page of 
user memory space as either non-existent (transparent) or as an 


Address Function 


C = 0or continue 

C = 1 or continue 

V = Oor continue 

V = 1 or continue 

N = 0 or continue 

N = 1 or continue 

Z = Oor continue 

Z = 1orcontinue 

H = 0 or continue 

H = 1 or continue 
LZ = 0 or continue 
LZ = 1 or continue 
HLT = 0 or continue 
HLT = 1 or continue 
IH = O or continue 
IH = 1 or continue 


0 
1 

2 
3 
4 
5 
6 
7 
8 
9 
A 
B 
C 
D 
E 
F 


Not used 
CALL 

Not used 
CALL N=0 
Not used 
RTS Z=1 
Not used 
RTS 

Not used 

Not used 

Not used 

Not used 

Not used 
BRMAP IH = Oor BR 
CONTINUE 





Figure 18. Microprogram Sequencer Branch Code. 


invalid access area. If any user instruction attempts to access 
memory in a page that has been assigned as an invalid access 
page, the microprogram control logic takes action. 


Before the current machine cycle completes, the next instruction 
address is forced to the highest value in the current 512-word 
microword block using the Am2909 OR inputs. At this point a 
microbranch to the invalid access trap microroutine is performed. 
The invalid access is processed just like another (highest) level of 
hardware vectored interrupt except that the current machine level 
instruction does not complete before the microprogram recog- 
nizes and acts upon the condition. 


MICROWORD MEMORY 


Any number of memory device types could have been chosen for 
the microword memory in the HEX-29 CPU. RAM has the ad- 
vantage that it is dynamically alterable, but if this feature is utilized 
much more hardware support would have been necessary and 
the overall cost increased significantly. Besides, the effect of 
writable control store can be simulated with fixed memory devices 
by microcode bank switching at much lower cost and complexity if 
the feature is desirable. For development of new microcode 
routines, RAM writable control store in the address space of 
another computer system offers many advantages. This is par- 
ticularly true if the other computer happens to support a micro- 
assembler and file management system as does the System 29.* 


Though EROM’s and EAROM’s are also viable microword mem- 
ory devices for microcode development, they are much too slow 
to make efficient use of the rest of the high speed micro- 
programmed processor in the production device. 


Fuse-link bipolar PROM’s are the only viable microword memory 
devices for production systems for a variety of reasons. They are 
very fast, (45ns maximum access on the HEX-29 CPU), smail 
(512 x 8 in 20 pins), less expensive than fast RAM, and more 
flexible than a mask ROM would be. Itis a simple matter to alter or 
extend the microprogram of commercial systems with fuse-link 
PROM microword memory. 


As mentioned, the microword memory of the HEX-29 is com- 
posed of AM27S29 512 x 8 fuse-link PROM’s and is shown in 
Figure 19. These space efficient 20 pin parts have worst case 
access times of 45ns over the commercial temperature and 
voltage range. Up to 4k of microword memory can be addressed 
by the set of three Am2909 microprogram sequencers on the 
CPU card. Space for up to 2k of microword memory PROM'’s is 
available on the HEX-29 CPU card. Though a perfectly 
adequate instruction set can be coded in less than 512-words of 
microword memory, the HEX-29 has a very extensive high level 
instruction set including 16 and 32-bit integer and 64-bit floating 
point ADD, SUB, MUL, DIV, CMP, and extensive buffering in- 
structions. In addition to the extremely complete numeric pro- 
cessing package, numerous nibble, character, byte, and word 
macroinstructions are implemented with scans, linked and un- 
linked searches, block moves, and etc. A stack processor is a 
subset of this more than complete instruction set. For all of the 
capabilities of the HEX-29 CPU, less than 1.5k of microword 
memory was required. Thus, more than 0.5k of space remains 
for future expansion by the user before a larger PC card Is 
needed (extremely unlikely). 


Connections for the microword data, address and select lines are 
available at connectors at the top of the HEX-29 CPU card. Thus, 
itis quite straightforward to support off-board microword memory. 


* System 29 is a development system for microprogrammed systems available from 
Advanced Micro Computers. 
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It is even perfectly reasonable to use an off-board writable control 
store with up to 2k of microword RAM concurrently with up to 2k of 
PROM resident on the PC card. 


If the on board PROM contains an instruction set, it is then a 
simple matter to use the off board writable control store to develop 
new microcode for the machine interactively on the one HEX-29 
system! 


The outputs of the microword memory devices are attached to the 
inputs of Am74S374 registers. These registers are called the 
pipeline registers since they allow the fetching of the next micro- 
instruction concurrently with execution of the current one. Clock- 
ing of the pipeline registers occurs on the LOW-to-HIGH transi- 
tion of the system clock. The outputs of the pipeline registers are 
the 64 microword (or pipeline) bits that control every aspect of the 
processor. 


These 64 bits can be logically grouped into several functional 
fields a follows: 


. Microword Data/Microbranch Address and Control 
. ALU Source Select 

ALU Destination Select 

. ALU Function Select 

. ALU Carry In Select 

Shift Linkage Select 

ALU A and B Specifications 

A and B Fields Select 

Enable onto ALU inputs Select 
10. Latch External Data Inputs 

11. Latch CPU Outputs 

12. Control Bus Signals 

13. Microprogrammed Clock 

14. Condition Code Controls 

15. Enable Interrupt Circuitry 

16. Memory Map Control 


OC ONAHAR WD > 


Notice that with the exception of the microword data and micro- 
branch address and control fields, no other fields are overlapped. 
This is a ‘horizontally’ structured design. Overlapping several 
fields leads to ‘vertically’ structured systems. This latter class of 
machines can save some microword memory, but only at the 
expense of through-put and increased hardware complexity. Now 
that the cost of the PROM’s has come down significantly, the 
savings accrued from using a vertically structured design ap- 
proach Is generally insignificant when compared with the overall 
system cost. 


A summary of the functions of the microword bits is shown in 
Figure 20. 


Am2901B ALU/REGISTER SETS 


The heart of the HEX-29 CPU Is the set of four Am2901B bit slice 
ALU/Register Sets depicted in Figure 21. All arithmetic and logi- 
cal operations are performed in these bipolar LSI IC’s, including 
address generation. The user accessible set of 16 registers and 
routing functions are also internal to these remarkable and ex- 
tremely versatile chips. 


The operation of these units, though very elegant and com- 
prehendible, is too lengthy to include here and the user ts referred 
to the Am2900 Family Data Book by AMD. 


Carry lookahead is accomplished by the Am2901B’s and an 
external IC, the Am2902A. Shift control is partially within the 
Am2901B’s and is supported by other external circuitry to be 
discussed later. 


A summary sheet of the Am2901B ALU functions appears on 
page 29 but should be supplemented by studying the AMD 
literature already mentioned. A good supplement is the AMD 
Schottky and Low Power Schottky Handbook. 


The A and B input fields to the Am2901B’s are multiplexed by 4 
Am74S253's in the following four ways. 


Am2901B B Inputs 


word Memory 

Upper Nibble ABL 
Lower Nibbie ABL Lower Nibble ABL 
Upper Nibble ABL Lower Nibble ABL 


ABL = A,B Latch (On data bus bits 27-20.) 


Am2901B A Inputs 


jeword Memory 
Upper Nibble ABL 


CARRY IN CONTROL 


The arithmetic carry-in (Cj) signal (Figure 22) to the Am2901B bit 
slices can be selected from four sources as follows: 
1. Logic OQ (No carry-in add instruction, borrow in subtract 
instruction.) 
2. Logic 1 (Carry-inin add instruction, no borrow in subtract.) 
3. Carry Flag (C bit in condition code register.) 
4. Q Shift Bit (Double length shifts.) 


Note that the natural state of the Carry Flag output from the 
Am2901B is 1 for carry on add, 0 for no carry on add, 1 for no 
borrow on subtract, and 0 for borrow on subtract. This convention 
has been maintained in the condition code and carry in logic. 
Some other machines operate differently with respect to this 
convention, but others do not and the HEX-29 maintains the 
faster convention for lack of a good reason to alter it. Some 
programmers will be required to remember this convention while 
others will be used to it. 


SHIFT AND ROTATE LINKAGE 


The shift and rotate linkage (Figure 23a) of the HEX-29 is com- 
posed of an Am74S253 and an Am74LS125 plus the internal shift 
control structure of the Am2901B’s. The functions that can be 
performed by this circuitry are shown in Figure 23b. 


The solid lines in Figure 23b delineate the basic shift linkages. 
The dotted lines are optional linkages which can also be enabled. 
With these linkages, all of the normal shifts and rotates can be 
performed plus a number of double word shifts including special 
shifts for high speed multiplies and divides. 


CONDITION CODE CONTROL 


The condition code register shown in Figure 24 of the HEX-29 has 
eight flags. The definitions and placement of these flags are 
defined in Figure 25. 


In addition to the very useful and fairly common C, V, N, Z flags, a 
half sign is provided for easier byte processing. The three user 
flags are not changed by any of the normal arithmetic or logical 
operations. However, they can be read by the processor and 
written by the processor with special instructions such as load 
flags, read flags, set bits in flags, clear bits in flags, invert bits in 
flags. The fact that none of the user flags is changed by any but 
this type of special routine is very significant. It means that various 
routines and program segments can pass flags back and forth 
freely without fear of modification or restriction on the instructions 
that can be executed. Reading the condition code flags into the 
processor, or branching or subroutining upon combinations of bits 
set or clear does not alter the flags. 


. VMA 
. SDMA 
. INE 


BRAO 
BRA1 

BRA2 
BRA3 
BRA4 
BRA5 
BRA6 
BRA7 
BRA8 
BRAQ 
BRAA 
BRAB 
BRCO 
BRC1 

BRC2 
BRC3 
BRC4 


Function 
Microprogram 
Branch 
Address 


Microword data to Internal 
data bus to Am2901B’s 


Microprogram 
Branch 
Control 


Latch in low nibble of data bus 
Control Bits A 
Shift and Rotate MUX B 


Am2901 Source Select Code 
Carry-In MUX Select Bit A 
Am2901 ALU Function Code 
Carry-In MUX Select Bit B 
Am2901 Destination Code 


Fetch Instruction this cycle 


Am2901 ‘B’ field register specification 


Am2901 ‘A’ field register specification 


A, B fields MUX Select Bit 0 (A, B fields on Am2901) 
A, B fields MUX Select Bit 1 (A, B fields on Am2901) 
Microprogrammed system clock stretch bit 0 
Microprogrammed system clock stretch bit 1 

Latch Data In — Both Swapped and Unswapped 

Latch N, Z, H flags — MUX Select 

Latch C, V flags — MUX Select 

Latch Condition Codes — U2, U1, U0, H, Z, N, V, C MUX Select 
Read Condition Codes onto internal bus, low byte 
Select Microword bits 15-0 to internal bus, else branch code 
Data In Low Byte Enable onto internal bus 

Data In High Byte Enable onto internal bus 

Swapped Data In Low Byte Enabled onto internal bus 
Swapped Data In High Byte Enabled onto internal bus 
Clear Low Byte on internal bus — Bit Set Enable HIGH 
Clear High Byte on internal bus — Bit Set Enable LOW 
Load Memory Map — Write into Memory Map RAM 
Read Memory Map — Enable Memory Map to data bus 
Latch Address — Enable Transparent Address Latch 
Bus Available — Busses available for DMA this cycle 
REad/Write Memory (Write if low) 

Valid Memory Address (Read or Write) this cycle 

Sync. DMA — Active cycle before bus is available 
Interrupt Logic Enable 


Figure 20A. Microword Bits. 
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Figure 20B. Am2909 Microprogram Branch Control, 


Bits 12-16. 
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Figure 20C. Condition Code Manipulation, 
Bits 45-47. 
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Figure 20E. Am2901 Destination Codes, 
Bits 28-30. 


| | Right | Left | 
|o | MUL | RCL_ 
ROR [ROL 















Figure 20G. Shift & Rotate Control, 
Bits 18-19. 


Figure 20 (Cont.). 





Figure 20F. Am2901 A, B Field Selects, 
Bits 40-41. 
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Figure 20H. Microprogrammed System 
Clock Stretch, Bits 42-43. 
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Figure 22. 
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Carry Flag (Arithmetic) 


Overflow (Two’s Complement Arithmetic) 


Negative (Most significant bit = 1) 
for N = 1) 
Zero — (Z = 1 if bits in 
word = 0) 
Half Sign — (H = 1 if bit 7-1) 
(Note: Not half carry) 


User Flag 0 — (Can be set/reset/ 
Tested by user only) 


User Flag 1 — (Can be set/reset/ 
tested by user only) 
User Flag 2 — (Can be set/reset/ 
tested by user only) 
MPR-674 





Figure 25. 


Eight condition code operations provide all the useful operations 
needed for complete flexibility. They are shown in Figure 26a and 
26b In two different formats. Note that the codes are grouped into 
three categories; arithmetic (C and V), logical/arithmetic (N, Z, H) 
and user (Us, U;, Uo). 


These eight conditions include all the necessary and desirable 
features such as updating only the shift carry bit and the ability to 
do operations that read, operate on, and reload the condition 
code register all in one machine cycle (160ns). Also, a feature of 
immense importance where microcoded floating point or fixed 
point math is concerned is the ability to update flags on a cycle by 
cycle basis! An unusual feature. 


Carry/ Negative/Zero/ User Flags 
Overflow C, V Half N, Z, H - U2, U1, UO 
7 


Shift Bit C,V V No Change No Change 




































*6 Shift Bit C,V V Load From Bus Load From Bus 
Load From Bus No Change No Change 
Load From Bus Load From Bus Load From Bus 
No Change No Change No Change 
No Change Update No Change 
Update No Change No Change 


Update Update No Change 


*Less useful than other codes but perfectly legal. 
Figure 26A. 


[Name ua iui {uo} [Zz [N{v | c 


Shift MSb or LSb into C NC} NC;}NC|NC|NC} NC; NC} S 
B;}B;BsNC| S 
NC|NC|NC/] B/] B 
BiB:iB|;BiB 
NC |} NC} NC} NC; NC 
w]e] pw | NC]NC 
NC}NC|NC] w |] pw 
emlelele de 


*Shift into C — 
Bus Load Rest 

jz = updated, NC = unchanged, B = loaded from internal bus, 

S = Shift Bit 

















*Bus Load C & V Flags 
Bus Load All Flags 
No Changes 
Update N, Z, H Flags 

Update C and V Flags 

Update C, V, N, Z, H 


Figure 26B. 


Am2901B OUTPUT BUS 


Being a highly structured, modular device, the HEX-29 CPU is 
very bus oriented. The output bus of the Am2901B’s generate the 
addresses and data to the rest of the system devices as well as 
some internal function. The four logical units on this bus (shown in 
Figure 27) are: 


1. Address Out Latches — (System Address bus) 

2. Data Out Latches—(System data bus) 

3. Memory Map/Latches— (Memory Management Features) 
4. Condition Code MUX — (For updating flags from processor) 


Any memory reference requires that an address be valid on the 
system address bus. The source of this address is generally one 
of the Am2901B internal registers or modifications thereof from 
previous fetch cycles (such as indexed addressing). 


On a write cycle, data must be placed on the system data bus. 
This is accomplished in the same manner as address generation 
except that a different microword bit is used to activate the data 
latches. 


In a multi-user/multi-task/timesharing environment, it is desirable 
to have a powerful memory management scheme. The HEX-29 
CPU implements this via a flexible memory mapping system 
where the upper four bits of the 16-bit address generated by the 
Am2901B’s are ‘mapped’ into seven address bits and a write 
protect bit. Invalid access traps and one Megabyte address space 
are integral features of this system. The loading of this MAP RAM 
(2Am29701’s) is also accomplished via the Am2901B output bus. 


Another important characteristic of the HEX-29 CPU is its ability 
to read, write, test and operate upon the eight condition code flags 
in the byte form. All eight flags can be written to by the Am2901B’s 
at one time, in one microcycle. This is very useful for many flag 
operations and Is absolutely necessary for efficient updating of 
the user flags for interroutine parameter and condition passing. 


The logic of these bussed systems is quite simple. A separate 
microword bit or bit field is used to cause each of these logical 
units on the bus to accept the data bus. Therefore, simple micro- 
programming techniques are applicable to this busing approach. 


Am2901B INPUT BUS 


Much of the power and modularity of the HEX-29 design is due to 
the highly structured bus approach on the Am2901B Data Inputs. 
The logical units that can drive this bus (Figure 28) are listed 
below: 


1. Data Input Registers 

2. Swap Input Registers 

3. Microword Data Registers 

4. Clear Upper Byte/Clear Lower Byte/Bit Op Logic 
5. Condition Code Register 


Data input from the system bus is captured in the data input 
registers and the swap input registers. The data input registers 
bring the upper and lower bytes of the data bus to the corre- 
sponding bytes in the Am2901B cascade while the swap input 
registers switch the upper byte of the data bus to the lower byte on 
the Am2901B cascade and the lower byte of the data bus to the 
upper byte of the Am2901B cascade. 


Additionally, logic to set all bits in the upper or lower byte to zeros, 
(clear upper byte and clear lower byte), allow selecting arithmetic 
or logical zeros in either byte field. If the bit set option is enabled, 
all bits are pulled low except the one selected by the hexadecimal 
value in the low nibble of the nibble latch from an instruction or 
other data source. 


All eight condition code bits can be enabled onto the low byte if 
desired. All flags can thus be sampled by the Am2901B’s at once. 
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Data from microword memory from three-state registers in par- 
allel with the pipeline register can be enabled onto the upper 
and lower bytes for direct loading of the Am2901B’s from 
microprograms. 


In the absence of any device being enabled onto a particular byte 
on this bus, it will be pulled up into a logic 1 state. This can be 
useful for masking in logical operations and filling or biasing in 
arithmetic operations. 


An important factor in the flexibility of this approach on the 
HEX-29 is that the upper and lower bytes of the data In registers, 
swap in registers, and the clear upper/lower byte logic are sepa- 
rately enabled. Also, the condition code register only drives the 
lower byte and the pull-up feature will operate on either byte 
individually. Thus the upper and lower bytes can be individually 
driven on a ‘mix and match’ basis from several sources. 


The versatility so generated allows numerous fast processing 
modes. See Table 5 for a list of all of the possible combinations of 
high and low byte inputs to the 2901B’s. 


TABLE 5. 


Into Upper Byte Into Lower Byte 
of Am2901’s of Am2901’s 


. Microword memory bits P15-P8 | Microword memory bits P7-PO 
Bit set value (upper byte) Bit set value (lower byte) 





Upper byte — data bus 
Upper byte — data bus 
. Upper byte — data bus 
. Upper byte — data bus 
Upper byte — data bus 
. Lower byte — data bus 
. Lower byte — data bus 


Lower byte — data bus 
Upper byte — data bus 
Clear lower byte 

All high generator 
Condition code register 
Lower byte — data bus 
Upper byte — data bus 


ODNANAWN=O0 







Lower byte — data bus 
10. Lower byte — data bus 
11 Lower byte — data bus 
12. All high generator 


Clear lower byte 

All high generator 
Condition code register 
Lower byte — data bus 

















13. All high generator 
14 All high generator 
15 All high generator 
16 All high generator 
17 Clear upper byte 

18. Clear upper byte 
19. Clear upper byte 
. Clear upper byte 
. Clear upper byte 


Upper byte — data bus 
Clear lower byte 

All high generator 
Condition code register 
Lower byte — data bus 
Upper byte — data bus 
All high generator 
Condition code register 
Clear lower byte 








*Note: Interestingly enough this is the only case in the entire table 
that the hardware CANNOT generate on the bus, but IS the 
ONLY one of these codes that CAN be generated by the AMD 
2901B slices! (How convenient!) 


Examples of uses for some of these modes include: 


—- 


Clearing upper byte for 8-bit index offset 

2. Fast bit set/clear/test/invert operations 

3. Set upper byte high to AND lower byte with upper byte 
change 

4. Clear upper byte to AND off upper byte and operate lower 

5. Upper byte of data bus to lower byte for all byte ops on upper 

byte 


6. Load defined values from microcode for tamper-proof con- 
stants, vectors, etc. 

7. Normal data input or address input without swap or modifi- 
cation. 

8. Clear upper byte and data in low-bite immediate ops, etc. 


INTERRUPT CONTROL 


The powerful maskable priority vectored interrupt system (Figure 
29) of the HEX-29 is a direct derivative of the incredible Am2914 
bipolar LSI interrupt control IC. This circuit is so well integrated 
that it uses only one microword bit and requires very little support 
circuitry. The general set of operations that can be executed by 
the Am2914 is shown below. For more detailed information on this 
chip see the Am2900 Family Data Book. 


Enable Request 

. Load Mask Register 

. Disable Request 

. Clear Mask Register 

. Bit Set Mask Register 

. Bit Clear Mask Register 

Load Status Register 

Set Mask Register 

REad Mask Register 

Read Status Register 

Read Vector 

Clear Interrupts Last Vector Read. 
Clear Interrupts via M Register 
Clear Interrupts via M Bus 
Clear all Interrupts 

Master Clear 


SPP OP OO OO OOo mM 4 


Flow charts of the actions taken in microcode by the HEX-29 CPU 
are shown in Figure 30 and Figure 31. 


DMA CONTROL 

The DMA structure is quite straightforward. There are eight ac- 
tive-LOW DMA request lines and eight corresponding DMA 
acknowledge lines. The highest priority requesting a DMA cycle 
at the beginning of the microcycle before DMA will be allowed 
gets an acknowledge signal that lasts up until the DMA cycle — at 
least. 


If no devices are requesting DMA, the NRQ (no request) bus 
signal goes LOW. This is an excellent opportunity for dynamic 
RAM circuitry to refresh sequential rows on each DMA cycle that 
NRQ is LOW. 


Another input signal DDMA, will override all priorities and not 
acknowledge any level of DMA request. This could be used by 
dynamic RAM refresh circuitry when it must be permitted to 
refresh itself soon or chance losing data. 


Many schemes of DMA handling can be accomplished with this 
simple and uncomplicated priority controlled system. An 
Am74S374 captures the DMA requests (Figure 32) on a cycle by 
cycle basis. An Am2913 prioritizes these requests and acknow- 
ledges the highest level request with a three-bit binary code. An 
Am74S138 expands this to the eight bits of DMA acknowledge 
that correspond to the eight input bits. The Am2913 supplies the 
NRQ bus signal and provides for the DDMA bus signal. 
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INTERRUPT SEQUENCE 


EXECUTE INSTRUCTION 


ANY 

INTERRUPT 

ACTIVE 
? 






HEX 29 








YES 





CURRENTLY 

IN SUPERVISOR 

MODE 
? 


NO 








YES 


PUSH USER MMO (IN TEMP.) ONTO STACK PUSH MMO ONTO STACK 


INTERRUPT VECTOR 


EXECUTION OF INTERRUPT 
ROUTINE IN SUPERVISOR. 
SUPERVISOR MAY RE-ENABLE 
INTERRUPTS IF DESIRED. AT 
END OF INTERRUPT PROCESSING 


ROUTINE, THE SUPERVISOR 
RETURNS CONTROL TO USER 
PROGRAM WITH AN 
RTI INSTRUCTION. 





MPR-678 


Figure 30. 


304 


RETURN FROM INTERRUPT 


FETCH RTI OP CODE 


CURRENTLY 
IN SUPERVISOR 


MODE 
? 


ILLEGAL ATTEMPT OF 
USER TO EXECUTE A 
PROTECTED INSTRUCTION. 


NO OPERATION ENABLE INTERRUPTS 


CONTINUE USER POP PC FROM STACK 
PROGRAM 


POP CC FROM STACK 


POP MMO INTO TEMP. 


DOES TEMP 
= 0? (i.e., RETURNING 
TO SUPERVISOR?) 


STORE SP AT PHYSICAL 00409 


EXECUTE NEXT 


LOAD SP FROM PHYSICAL 0040A SUPERVISOR INSTRUCTION 


TEMP -~ MMO 


EXECUTE NEXT USER 
INSTRUCTION 
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Figure 31. 
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SYSTEM BUS INTERFACE EXAMPLE 
HEX-64KBS STATIC MEMORY CARD 


It was possible to design the system bus to be very simple to work 
with because the HEX-29 is a microprogrammed device. The 
following section discusses an implementation of a 64k byte static 
memory card for the HEX-29 system bus using Am9124 memory 
ICs. The purpose is to show that designing cards that interface 
with the HEX-29 system bus is relatively easy. Note that a design 
for I/O devices would be similar to this implementation since I/O 
devices are memory mapped and share exactly the same set of 
bus signals and timing requirements. 


Starting from the left hand of the schematic shown in Figure 33, 
we find that the low 13 bits of the address bus and the four control 
bus signals (CLK, VMA, R/W, and WP) are buffered from the 
system bus by two Am74S240 ICs and three sections of an 
Am74S244. These are inverting and non-inverting buffers re- 
spectively, and offer extremely high current drive (64mA sink 
current) and very high speed (~4 to 6ns) with only very light bus 
loading (400A low level). 


Ten of the address lines buffered by these ICs then drive the 
address lines of half of the memory array through series type 
termination resistors. These resistors (~33 ohms) serve to pre- 
vent undershooting zero volts by more than the permissible 0.5V 
on negative edge transitions of the address lines. This type of 
termination has the advantage that it does not draw current from 
the driver ICs; it is highly recommended over split termination for 
memory arrays where current loading is negligible, but capacitive 
loading is significant. Note that to further reduce these capacitive 
loading effects, the address lines of only half of the memory array 
are driven by one set of buffers. (Find the second set of 
Am74S240 address buffers at the far right of the schematic.) 


The remaining 3 address lines that were buffered by the 
Am74S240s drive the A, B, and C inputs of (4) Am74S138 
one-of-eight decoders. These ICs develop the 32 1k word chip 
selects that enable the appropriate Am9124 memory ICs for read 
and write operations when they are addressed. 


Of course only one of the Am74S138 ICs should be enabled when 
the board is addressed. This is a function of the higher address 
lines, A18-A13. Since each Am74S138 is able to select 1k word 
blocks of memory, each Am74S138 should be addressable on 8k 
word boundaries. Decoding the upper address lines (A18-A13) to 
match selectable 8k boundary addresses is accomplished with 
four Am25LS2521 8-bit equal to comparators, one for each 
Am74S138. 


The DIP switches on the right hand side of each Am25LS2521 
define the conditions under which the corresponding Am74S138 
will be selected. When the eight inputs on the left hand side of 
these chips correspond to the values set on the DIP switch on the 


right, the Am74S138 is enabled. Note that the VMA bus signal 
(Valid Memory Access) must be LOW to enable the 
Am25LS2521. Also note that each 8k word bank can be uncondi- 
tionally removed from the system memory space by leaving the 
lowest DIP switch open. Thus the board may be filled in 8k word 
increments if desired. 


The Am74S138 ICs are also enabled by the system clock via the 
CLK signal. Therefore, memory chip selects can only occur dur- 
ing the time that the system clock is LOW (called 2). The impor- 
tance of this will be discussed shortly. Another signal that must be 
valid for these ICs to be enabled is the DiS Signal. Whenever the 
R/W signal is LOW (indicating a write) and WP (write protect) is 
HIGH (protect the memory), then the DIS signal is brought LOW. 
This disables the Am74S138’s and blocks the selecting of any 
memory ICs, thereby write protecting all on-board memory. 


Above the memory array on the schematic are the data bus 
buffers, one set for each half. Again, this is done to reduce 
Capacitive loading, this time on the data lines. Am74S373 octal 
tri-state latches are used for all eight of these data buffers. The 
enabie inputs are driven by the inversion of the system clock bus 
signal so that they are transparent during all of do, which is when 
the data is transferred. The appropriate Am74S373 latches are 
turned on (OE LOW) during read and write signals so that the data 
is buffered in the proper direction. 


The Am26S02 one-shot is used to stretch 2 of the system clock 
to meet the access time of the memory. Without this signal, do 
would last only 80ns and the access time specifications of the 
Am9124 memory ICs would not be met. The Am26S02 is acti- 
vated whenever memory ICs on the board are addressed when 
the system clock enters ¢2 (negative edge). Once fired, the 
duration of $9 is stretched by 40ns for every 40ns that the STR 
bus signal is held LOW. Since the Am9124 EPC memory de- 
vices have an access time of 200ns worst case, #2 must be 
stretched by 120ns. 


Summary 


As can be seen, the HEX-29 16-bit design represents a simple, 
straightforward design approach to building a high-performance 
16-bit processor. This design takes advantage of many of the 
features of the Am2901 and Am2909. The instruction set shown 
in this application note is intended to be representative of the 
more common types of instructions to be executed on a machine 
of this class. In addition, microcode could be developed to exe- 
cute a great many additional instructions as well as other classes 
of instruction such as entire floating point package. This design 
utilizes microprogram control throughout, and is a good demon- 
stration of parallel microprogramming in a most straightforward 
application. 


AMD wishes to thank Mr. Mike Simmons and Mr. Lee McDonald 
of HEX for their work on this invited paper as a part of this 
application note series. 


APPENDIX 


HEX-29 Microcode 


This appendix contains 256 words of HEX-29 microcode. The first 
part is a definition file which defines the HEX-29 hardware struc- 
ture for the AMDASM™ assembler. The various inputs to the 
Am2901 are defined via equates while all other microword fields 
are literally defined. The second part is the assembly file which 
symbolically, via terms defined in the definition phase, constructs 
each microword. Each microword begins with an optional label 
(such as RESET:). Next is the Am2909 branch control field, 
followed by all of the remaining control fields. This structure gives 
the appearance of a conventional assembler, i.e., LABEL, OP- 
ERATION, OPERANDS. A microinstruction which has no 


Am2909 branch control specified, such as microwords 3 and 4, 
uses microword bits 0-15 (which includes the branch control field) 
to place immediate data directly on the internal Am2901 bus. The 
Am2909 is then forced to “CONTINUE” by the “LIN” field. LIN, 
besides Latching IN the data on the Am2901 bus, disables the 
microprogram branch control register output, causing the “CON- 
TINUE” function to be selected in the branch control PROM (see 
Figure 20B). 


These 256 microwords represent a reasonable subset of the 
HEX-29 standard instructions, i.e., branch, conditional branch, 
data moves (MDV), and, or, add, sub, etc. 
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308 


CRD €4 


Sd SS Ox SU ES bat ee ee we ae ee we we 


’ 
3 AM2921 


’ 

AQ: 
AB: 
ZQ: 
ZB: 
ZA: 
DA: 
DQ: 
LZ: 


’ 
3 AM2S01 


y 

ADD: 
SUBR: 
SUBS: 
OR: 
AND: 
NOT: 
EXOR: 
EXNCR: 


’ 
3 AM2S¢1 


’ 
QREG: 
NOOP: 
RAMA: 
RAMF: 
RAMQD: 
RAMD: 
RAMQU: 
RAMU: 


; 
AM2921 


, 
NOINE: 
INE: 


’ 
NCSDMA: 
SDMA: 


’ 
VMA: 
NOVMA: 


’ 
wRITE: 
READ: 


’ 
BA: 
NOBA: 


o] 
NOLAD: 
LAD: 


’ 
RMM: 
NORMM: 


? 
LMM: 
NOLMM: 


’ 

CLRHL: 
CLRLO: 
CLRHI. 
NOCLB: 


, 

SWPHL: 
SWPLO: 
SWPHI: 
NOSWP: 


! 

DINHL: 
DINLO: 
DINHI: 
NODIN: 


NOSDA: 
SDA: 


’ 
RCC: 
NORCC: 


’ 

NOLCC: 
LCC: 
LCV: 
LCCLCV: 
LNZ: 
LNZCC: 
LNZCV: 
LNZCCV: 


; 
NCLDI: 
LDI: 


s 
NOSTR: 
STR1: 
STR2: 
STR3: 


, 
MWAMWB: 
RSRD: 
RSRS: 
RDRD: 


MZSO1 REGISTER EQUATES 


HEX-29 DEFINITION PHASE (PHASE 1) 


EQU Ha@ 

EQU HA1 

EQ H#2 

EQU Rad 

EQU H#4 

EQU He5 

EQU Ha6 

EQU He? 

EQU H#8 

EQU H#9 

EQU HAA 

EQU HaB 

EQU HAC 

EQU HaD 

EQU HAE 

EQU HAF 

SOURCE (R $) OPERAND EQUATES 
EQU QHe 

EQU Qe#i 

EQU Q#H2 

EQU QH#3 

EQU QH4 

EQU QHs 

EQU QH6 

EQU Qa? 

ALU FUNCTION (R FUNCTION S) EQUATES 
EQU Qae 3R+S 
EQU Qi 3S-R 
EQU Q#2 3R-S 
EQU QH3 3R S 
EQU QH4 3k S 
EQU Q#5 3k S 
EQU QHE 3k S 
EQU Qa? 3k S 


DESTINATION CONTROL EQUATES 


EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 
EQU 


DEF 


DEF 
DEF 


DEF 
DEF 


DEF 
DEF 


DEF 
DEF 


DEF 
DEF 


DEF 
DEF 


DEF 
DEF 


DEF 
DEF 


DEF 
DEF 
DEF 
DEF 


DEF 
DEF 
DEF 
DEF 


DEF 
DEF 
DEF 
DEI 


DEF 
DEF 


DEF 
DEF 


DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEF 


DEF 
DEF 


DEF 
DEF 
DEF 
DEF 


DEF 
DEF 
DEF 
DEF 


Quo 
Qa1 
Q#e 
Qas 
QH4 
Qad 
QH6 
Q#? 


000 


wee we We we we we owe 


24X,4VX,4VX,1X,3VX,1X,3VX,1X,3VX, 2X 


BHO, 63K 
BH1,635X 


1.,B40,62x 
1X,B#1,€2X 


2%, B#0,61% 
2X,BH1,€1X 


3X , BA, 60X 
ZX, BH1 ,68X 


4X, Bad, 59X 
4X ,B#1,59X 


oX , B#2,58X 
5X, B#1,58X 


EX, BAG, 57% 
EX ,BH1,57X 


7X, BHO ,S6X 
?X,BH1, SEX 


QX , B#O0, 54X 
@X,B#01,54X 
GX, B#18, 54X 
aX, B#11,54X 


10X, BAD, 52X 
1X ,B#81,52X 
1@X,B#10 , 52X 
10X ,B#11,52X 


12X , BAGS , 58x 
12X, BHS1, Sex 
12X ,B#10,50X 
12X,B#11,50X 


14X, BHO, 49X 
14X,B#1,49X% 


15X ,Ba®,4EX 
15X,B#1,48X 


16X , Q#0, 45X 
16K ,Q#1,45% 
16X, #2, 45X 
16X ,Q#3,45X 
16X ,Q#4,45X 
16X,Q#5,45X 
16X ,Q#6,45X 
16X,Q#7, 45x 


19X, BAG, 44X 
19X ,B#1,44X 


20X , BHOD ,42X 
20X ,B#1,42X 
20X ,BH#18 ,42X 
20X,B#11,42x 


22X,BHO2 , 40X 
22X,B#01,40X 
22k ,BH1G , 40X 
22X ,B#11,40X 


3INE = INTERRUPT LOGIC ENABLE 
sSDMA = SYNC DMA 

sVALID MEMORY ACCESS THIS CYCLE 
;READ/NOT-WRITE MEMORY 

3BUS AVAILABLE THIS CYCLE 

;LAD = ENAB TRANSPARENT ADDR LATCH 
;READ MEMORY MAP 

3LOAD MEMORY MAP 


;CLEAR HI/LO BYTE INTERNAL BUS 
3SWAP DATA HI/LO INTERNAL BUS 
;DATA IN HI/LO INTERNAL BUS 


;SDA = SELECT MICROWORD BITS 15-0 
; TO INTERNAL BUS 


;READ CC TO INTERNAL BUS 


3CC MUX SELECT 


" 


3 LDI LATCH DATA IN, SWAPPED & UNSWAPPED 


CLOCK STRETCH CONTROL 


3STR 


32901 A,B MUX SELECT 


3 
NOFTCH: DEF 


32X, BO, 31X ;FETCH = 
FETCH: DEF Z2X,B#1,3512X 
’ 
NCCIB: DEF 36X , BHO, 27X 7CIB = 
CIB: DFF 36X,BH#1,27% 
? 
NOCIA: DEF 40X,B#@,23% >CIA = 
CIA: DEF 40X ,B#0,22X 
’ 
RCLMUL: DEF 44X ,B#00 ,18X ‘SHIFT & 
ROLROR: DEF 44X,B#01,18X 
DRLASR: DEF 44X,B#10,18X 
LSLLSR: DEF 44X ,B#11,18X 
, 
NOLIN: DEF 46X, BHO, 17X 
LIN: DEF 46X,B#1,17X 
, 
BRCO: DEF 47X, BH#OO200,12X 
BRC1: DEF 47X , B#60001,12X 
BRVO: DEF 47X,B#00010,12x 
BRV1: DEF 47X ,B#00@11,12X 
BRNO: DEF 47%, BHOZ100,12X 
BRN1: DEF 47X ,B#00101,12X 
BRZ@: DEF 47X, B#08110,12% 
BRZ1: DEF 47X ,B#@6111,12X 
BRHS@: DEF 47X ,B#01090,12X 
BRHS1: DEF 47X ,B#@1001,12X 
BRLZ@: DEF 47X,B#01010,12X 
BRLZ1: DEF 47X ,B#01011,12X 
BRHLT@: DEF 47X, B#81190,12X 
BRELT1: DEF 47X,B#01181,12X 
BRIHO: DEF 47X ,B#01110,12X 
BRIH1: DEF 47X,B#@1111,12X 
BRANCH: DEF 47X,B#10000,12Xx 
CALL: DEF 47X ,B#10018,12Xx 
CNS: DEF 47X,B#10100,12X 
RTSZ21: DEF 47X,B#10110,12X 
RET: DEF 47X ,B#11900,12X 
BRIHMAP: DEF 47X,B#11110,12X 
CONTNUE: DEF 47X,B#11111,12X 
’ 
DATA: DEF 48X , 16VH#O200 


= 
ow 


HEX-29 MICROPROGRAM ASSEMBLY PHASE 
(FIRST 256 MICROPROGRAM WORDS) 


ESET: CONTNUE AM2901 = R@ ’ 
NOINE & SDMA 

BA & NOLAD 
NOSWP & NODIN 
NOLDI & NOSTR 


NOCIA & RCLMUL 


NN NAN Bt we we ee ee ee ee we txt we we 


CONTNUE AM2901 R®@ ’ 
NOINE & NOSDMA 
BA & NOLAD 
NOSWP & NODIN 
NOLDI & NOSTR 


NOCIA & RCLMUL 


~nnNnN NS 


AM29@1 
NOINE 
NOBA 
NOSWP 
NOLDI 
NOCIA 


CONTNUE RO ’ 
& NOSDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


AM2981 = R® ’ 
INE & SDMA 

NOBA & NOLAD 
NOSWP & NODIN 
NOLDI & NOSTR 
NOCIA & RCLMUL 


~Nw SNS 


~~ RS 


AM2981 R15 ’ 
INE & SDMA 

BA & NOLAD 
NOSWP & NODIN 
NOLDI & NOSTR 
NOCIA 6& RCLMUL 


~~ SSS 


BRANCH IFETCH & AM29@1 RO ’ 
NOINE & NOSDMA 
BA & NOLAD 
NOSWP & NODIN 
NOLDI & NOSTR 


NOCIA & RCLMUL 


MMMMOMM COMM MM MMAMMMM SCHOMMMM MMM PHM MOY HMOO| 


~~ SS 


ORG H#G888 


IFETCH: AM29@01 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


R15 ‘ 
& SDMA 
& LAD 
& NODIN 
& NOSTR 
& RCLMUL 
AM29@1 R15 ’ 
KCINE & NOSDMA 
BA & LAD 
NOSWP & DINHL 
NOLDI & NOSTR 
CIA & RCLMUL 


a 
42) 
3 
w 


BRIHMAP IHR 


el oc 
M MOM MM OMM NM HM 


ORG Hageec 


2] 
toad 
+ 


BRANCH INSTR AM2991 
NOINE 
NOBA 
NOSWP 
LDI 


NOCIA 


R15 ’ 
& SDMA 

& LAD 
& DINHI 
& NOSTR 
& RCLMUL 


Ee] 
bed 
' 


BRANCH INSTR AM29@1 R15 ’ 
NOINE & SDMA 


KOBA & LAD 


~N ~~ NS 
POM MM Yr MMM 


FETCH INSTRUCTION THIS CYCLE 


CARRY-IN MUX SELECT BIT B 


CARRY-IN MUX SELECT BIT A 


ROTATE MUX CTL BITS 


;LATCH-IN LOW NIBBLE OF DATA BUS 


32989 NEXT INSTRUCTION CONTROL 


sMICROWORD DATA 


R@ » QkFG = , ADD » AQ 
& NOVMA & READ 

& NORMM & LMM & NOCLB 

& NOSDA §& NORCC §& LCCLCY 
& MWAMWB & NOFTCF & NOCIB 

& NOLIN 


RO » QREG , ADD » AQ 
& NOVMA & READ 

& NORMM & LMM & NOCIB 
& NOSDA & NORCC & LCCLCV 
& MWAMWB & NOFTCH & NOCIB 
& NOLIN 


R@ » NOOP , 

& NOVMA & WRITE 
& NORMM & NOLMM & NOCLB 

& NOSDA & NORCC & LCCLCV 
& MWAMWB & NOCFTCER & NOCIB 

& NOLIN 


AND + DZ 


RO » NOOP  , OR » DZ 
& NOVMA & WRITE 

& NORMM & LMM & NOCLB 
& SDA & NORCC & LCCLCV 
& MWAMWB & NOFTCH & NOCIB 
& LIN & DATA H#@@OD 


R15 » RAMF , OR » D2 
& NOVMA & READ 


& NORMM & LMM & NOCLB 


& SDA & NORCC & LCCLCV 
& MWAMWB & NOFTCH & NOCIB 
& LIN & DATA H#@208 
R@ » QREG , ADD » AQ 
& NOVMA & READ 

& NORMM 6& LMM & NOCLB 


& NOSDA & NORCC & LCCLCV 
& MWAMWB & NOFTCH & NOCIB 
& NOLIN 


R15 » RAMA , ADD » ZA 
& VMA & READ 

& NORMM & LMM & NOCLB 
& NOSDA §& NORCC §& LCCLCV 
& MWAMWE & FETCH & NOCIB 
& LIN & DATA H#F OOS 
R15 » RAMF , ADD » 2B 


& NOVMA & READ 
& NORMM & LMM & NOCLB 
&§ NOSDA & NORCC & LCCLCV 


& MWAMWB & NOFTCH & NOCIB 
& NOLIN 

R15 » RAMF , ADD » DA 
& VMA  & READ 


& NOKMM & LMM & CLRLO 

& NOSDA & NORCC & LCCLCY 
& MWAMWB & FETCH & NOCIB 

& NOLIN 


R15 » RAMP, 
& VMA & READ 
& NCRMM & LMM & NOCLB 


ADD » DA 








~~N™ 


SR+: 


~N NNN 


~N NNN. td ~N NNN ~NNNNN ~NNNN OS ~N NNN. ot ~n~N NS 
oe) 
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| 


=z 
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oe t + 

ee oe 
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~~ SS 


CONTINUE 


BRANCK 


CONTNUE 


BRANCH 


BFC 


BRANCH 


BRCO 


BRANCH 


BRC1 


BRANCH 


BRC1 


BRANCH 


BRVG 


BRANCH 


BRVO 


BRANCH 


BRV1 


BRANCH 


IFETCH 


IFETCH 


IFETCH 


INSTR 


IFETCH 


INSTR 


IFETCH 


INSTR 


IFETCH 


INSTR 


I FETCH 


INSTR 


IFETCH 


INSTR 


IFETCH 


INSTR 
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NOSWP 
LDI 
NOCIA 


AM29901 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM 2901 
NOINE 
NOBA 
NOSWP 
NOLDI 
NOCIA 


AM29@1 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2981 
NOINE 
NOBA 
NOSWP 
NOLDI 
NOCIA 


AM29@1 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 
AM29@1 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA~ 


AM2901 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2961 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2921 
NOINE 
FA 
NOSWP 
NOLDI 
NOCIA 


AM2921 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM29@1 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2961 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2981 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2981 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AMz9@1 
NOINE 
BA 
NCSWP 
NOLDI 
NOCIA 


AM2961 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


4M2981 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 


& DINEI 


& NOSDA & NORCC 


& LCCLCV 


& NOSTR & MWAMWB & FETCH & NOCIB 


& RCLMUL 


R14 ’ 
& NOSDMA 
& LAD 
& NODIN 
& NOSTR 
& RCLMUL 


Ri5 ’ 
& NOSDMA 
& NOLAD 
& DINEI 
& NOSTR 
& RCLMUL 


R14 ’ 
& NOSDMA 
& LAD 
& NODIN 
& NOSTR 
& RCLMUL 


R15 ’ 
NOS DMA 
NOLAD 
DINHI 
NOSTR 
RCLMUL 


POO Mm HM 


R15, 
& NOSDMA 
& NOLAD 
& DINHI 
& NOSTR 
& RCLMUL 

Ris), 
& SDMAy 
& LAD 
& NODIN 
& NOSTR 
& RCLMUL 


Ris ’ 
& NOSDMA 
& NOLAD 
& DINHI 
& NOSTR 
& RCLMUL 


R15 ’ 
& SDMA 
& LAD 
& NODIN 
& NOSTR 
& RCLMUL 


R15 ’ 
& NOSDMA 
& NOLAD 
& DINHI 
& NOSTR 
& RCLMUL 


R15 ’ 
& SDMA 
& LAD 
& NODIN 
& NOSTR 
& RCLMUL 


R15 ? 
& NOSDMA 
& NOLAD 
& DINHI 
& NOSTR 
& RCLMUL 


R15 ’ 
& SDMA 
& LAD 
& NODIN 
& NOSTR 
& RCLMUL 


R15 ’ 
& NCSDMA 
& NOLAD 
& DINFI 
& NOSTR 
& RCLMUL 


R15 ’ 
& SDMA 
& LAD 
& NODIN 
& NOSTR 
& RCLMUL 


RiS ’ 
& NOSDMA 
& NOLAD 
& DINKI 
& NOSTR 
& RCLMUL 


R15 ’ 
& SDMA 
& LAD 
& NODIN 
& NOSTR 
& RCLMUL 


R15 ’ 
& NOSDMA 
& NOLAD 
& DINHI 
& NOSTR 
& RCLMUL 


R15 ; 
& SDMA 
& LAD 
& NODIN 


& NOLIN 
R14 » RAMF , SUBR 
& NOVMA & READ 


& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & NOFTCH & 
& NOLIN 


R15 » RAMA =, ADD 
& VMA & WRITE 

& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & NOFTCH & 
& NOLIN 


R14 » RAMF , SUBR 
& NOVMA & READ 

& NORMM & LMM & 
& NCSDA & NORCC & 


& MWAMWR & NOFTCH & 
& NOLIN 


R15 » RAMA = , ADD 
& \VMA & WRITE 

& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & NOFTCH & 
& NOLIN 

R15 + QREG , ADD 
& NOVMA & READ 

& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & NOFTCH & 
& NOLIN 

R15 » RAMF , ADD 
& \VMA & READ 

& NORMM & LMM & 
& ACSDA & NCRCC & 
& MWAMWP & FETCH & 
& NOLIN 


R15 » QREG , ADD 
& NOVMA 6& READ 

& NORMM & LMM & 
& NOSDA & NCRCC & 
& MWAMWB & NCFICH & 
& NOLIN 

R15 » RAMF , ADD 
& VMA & READ 

& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & FETCH & 
& NOLIN 


R15 » QREG , ADD 
& NOVMA & READ 

& NORMM & LMM & 
& NOSTA & NORCC & 
& MWAMWB S NOFTCP & 
& NOLIN 


R15 » RAMF , ADD 
& VMA  §& READ 

& NORMM & LMM & 
& NOSDA 6& NORCC & 
& MWAMWB & FETCH & 
& NOLIN 


R15 » QRES , ADD 
& NOVMA & READ 

& NORMM & LMM & 
& NOSDA 6& NORCC & 


& MWAMWB & NOFTCH & 
& NOLIN 


R15 » RAMF , ADD 
& VMA & READ 

& NORMM & LMM & 
& NOSDA & NORCC & 
S MWAMWB & FETCH & 
& NOLIN 


R15 » QREG , ADD 
& NOVMA & READ 

& NORMM & LMM 6 
& NOSDA & NORCO & 
& MWAMWB & NCFTCH & 
& NOLIN 


R15 + RAMF , ADD 
& \VMA & READ 

& NORMM §& LMM & 
& NOSDA & NORCC & 
& MWAMWB & FETCH 6& 
& NOLIN 


R15 » QREG , ADD 
& NOVMA & READ 

& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & NOFTCH & 
& NOLIN 


R15 » RAMF , ADD 
& VMA & READ 
& NORMM & LMM & 


& NOSDA & NORCC & 
& MWAMWB & FETCH & 
& NOLIN 


R15 » QREG , ADD 
& NOVMA & READ 

& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & NOFTCH & 
& NOLIN 


R15 » RAMF , ADD 
& WMA & READ 

& NORMM & LMM & 
& NOSDA 6& NORCC & 


>» ZA 


NOCLE 
LCCLCV 
NOCIB 


» DA 


CLRLO 
LCCLCV 
NOCIB 


» ZA 


NOCLB 
LCCLCV 
NOCIB 


» DA 


NOCLB 
LCCLCV 
NOCIB 


CLRLO 
LCCLCV 
NOCIB 


NOCLB 
LCCLCV 
NOCIB 


NOCLB 
LCCLCV 
NOCIBb 


NOCLB 
LCCLCV 
NOCIB 


CLRLO 
LCCLCV 
NOCIB 


» 2Q 


NOCLB 
LCCLCV 
NOCIB 


» DA 


NOCLB 
LCCLCV 
NOCIB 


» 2Q 


NOCLB 
LCCLCV 
NOCIB 


» DA 
CLRLO 


LCCLCV 
NOCIB 


NOCLB 
LCCLCV 
NOCIB 


» DA 
NOCIB 


LCCLOV 
NOCIB 


» 2Q 
NOCLB 


LCCLCV 
NOCIB 


» DA 
CLRLO 
LCCLCV 
NOCIB 

+ 2Q 


NOCLB 
LCCLCYV 


4 < < 
+ ' 
. oo 


~~  t ~N NS 
Zz 
ra 
+ 
. 


~NN NON ~onNNNS 
z= 
2 
i] 
oe 


Z+3 
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N 
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SSS ~~ NSS 
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nn 


IFETCH 


BRV1 


BRANCH INSTR 


BRN@ 


IFETCE 


BRANCH 


INSTR 


BRN® 


IFETCH 


BRANCH 


INSTR 


BRN1 


IFETCH 


BRANCH INSTR 


BRN1 


IFETCH 


BRANCH INSTR 


BRZO IFETCH 
BRANCH INSTR 
BRZO 0208 


BRANCH INSTR 


BRZ1 


8888 


BRANCH INSTR 


BR2Z1 


3888 


BRANCH INSTR 


MOMAMA MMOMMMM MMMAMM MOM OHMM OM 


PMPPOMM POMMAMM MAMOMMM MOMM HMM MAM MMM 


MMMM MM MMM MMM PMMOMM SPOMMM NM PMO MMM PMMOMM HM 


MOM MM MOM MMM MM MMM MH 


LDI 
NOCIA 


AM2921 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2981 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM29@1 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2961 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2981 
NOINE 
BA 
NOSWP 
KOLDI 
NOCIA 


AM2981 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM29@61 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2961 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2961 
NOINE 
BA 
NOSWP 
NOLDI 
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AM2981 
NOINE 
NOBA 
NOSWP 
LDI 
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AM2901 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2961 
NOINE 
NOBA 
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AM2901 
NOINE 
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NOSWP 
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NOCIA 


AM2961 
NOINE 
NOBA 
NOSwP 
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AM29@1 
NOINE 
BA 
NOSWP 
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NOCIA 


AM2961 
NOINE 
NOBA 
NOSWP 
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AMz991 
NOINE 
BA 
NOSWP 
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AM2961 
NOINE 
NOBA 


& NOSTR 
& RCLMUL 


R15 ’ 
& NOSDMA 
& NOLAD 
& DINE! 
& NOSTR 
& RCLMUL 


R15 ’ 
& SDMA 
& Lad 
& NODIN 
& NOSTR 
& RCLMUL 


R15 ’ 
& NOSDMA 
& NOLAD 
& DINHI 
& NOSTR 
& RCLMUL 


R15 ’ 
& SDMA 
& LAD 
& NODIN 
& NOSTR 
& RCLMUL 
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& DINEI 
& NOSTR 
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R15 ’ 
& SDMA 
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& NOSTR 
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& SDMA 
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& NOSTR 
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& NOSTR 
& RCLMUL 
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& SDMA 
& LAD 
& NODIN 
& NOSTR 
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& NOSTR 
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& LAD 


& MWAMWB & FETCH & 
& NOLIN 


R15 » QREG , ADD 
& NOVMA & READ 

& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & NOFTCH & 


& NOLIN 
R15 » RAMF_ , ADD 
& VMA & READ 


& NORMM & LMM & 
& NOSDA 6& NORCC & 
& MWAMWB & FETCH & 
& NOLIN 


R15 » QREG , ADD 
& NOVMA & READ 

& NORMM & LMM 6 
& NOSDA & NORCC & 
& MWAMWB & NOFTCH & 


& NOLIN 
R15 » RAMF , ADD 
& VMA §& READ 


& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & FETCH & 
& NOLIN 


R15 » QREG , ADD 
& NOVMA & READ 

& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & NOFTCH & 


& NOLIN 
R15 » RAMF , ADD 
& VMA & READ 


& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & FETCH 6& 
& NOLIN 


R15 » QREG , 
& NOVMA & READ 
& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & NOFTCH & 
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& NOLIN 
R15 » RAMF , ADD 
& VMA & READ 


& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & FETCH & 
& NOLIN 


R15 » QREG , ADD 
& NOVMA & READ 

& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & NOFTCH & 


& NOLIN 
R15 » RAMF , ADD 
& VMA & READ 


& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & FETCH & 
& NOLIN 


R15 » QREG , ADD 
& kOVMA & READ 

& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & NOFTCH & 


& NOLIN 
R15 » RAMF , ADD 
& VMA & READ 


& NORMM & LMM & 
& NOSTA & NORCC & 
& MWAMWB & FETCH & 
& NOLIN 


R15 » QREG , ADD 
& NOVMA & READ 

& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & NOFTCH & 
& NOLIN 


R15 » RAMF , ADD 
& VMA & READ 

& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & FETCH & 
& NOLIN 

R15 » QREG , ADD 
& NOVMA & READ 

& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & NOFTCH & 


& NOLIN 
R15 » RAMF , ADD 
& VMA & READ 


& NORMM & LMM é 
& NOSDA & NORCC & 
& MWAMWB & FETCH & 
& NOLIN 


R15 » QREG , ADD 
& NOVMA & READ 

& NORMM & LMM & 
& NOSDA & NORCC & 
& MWAMWB & NOFTCH & 


& NOLIN 
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LDI 
NOCIA 


AM29901 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
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AM29@1 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOS@P 
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AM2961 
NOINE 
BA 
NOSWP 
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AM2901 
NOINE 
NOBA 
NOSWP 
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AM2921 
NOINE 
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NOSWP 
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AM29@1 
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NOBA 
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AM2901 
NOINE 
BA 
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AM2991 
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& NOSTR & MWAMWB & FETCH 6& 
& RCLMUL & NOLIN 

R15 » R15 » QREG , ADD 
& NOSDMA & NOVMA & READ 
& NOLAD & NORMM & LMM & 
& DINRI & NOSDA & NORCC & 
& NOSTR & MWAMWB & NOFTCH & 
& RCLMUL & NOLIN 

R15 » R15 » RAMF , ADD 
& SDMA & VMA & READ 

& LAD & NORMM & LMM & 
& NODIN & NOSDA 6& NORCC 6 
& NOSTR & MWAMWB & FETCR & 
& RCLMUL & NOLIN 

R15 » R15 » QREG , ADD 
& NOSDMA & NOVMA & READ 

& NOLAD & NORMM 6& LMM & 
& DINHI 6& NOSDA & NORCC & 
& NOSTR & MWAMWB & NOFTCH & 
& RCLMUL & NOLIN 

R15 » R15 » RAMF , ADD 
& SDMA & VMA- & READ 

& LAD & NORMM & LMM & 
& NODIN & NOSDA & NORCC & 
& NOSTR & MWAMWB & FETCH 6& 
& RCLMUL & NOLIN 

R15 » R15 » QREG , ADD 
& NOSDMA & NOVMA & READ 

& NOIAD & NORMM & LMM & 
& DINHI & NOSDA & NORCC & 
& NOSTR & MWAMWB & NOFTCH & 
& RCLMUL & NOLIN 

R15 » R15 » RAMF , ADD 
& SDMA & VMA- & READ 

& LAD & NORMM & LMM & 
& NODIN & NOSDA & NORCC & 
& NOSTR & MWAMWB & FETCH 6 
& RCLMUL & NOLIN 

R15 » R15 » QREG , ADD 
& NOSDMA & NOVMA & READ 

& NOLAD & NORMM & LMM & 
& DINHI & NOSDA §& NORCC & 
& NOSTR & MWAMWB & NOFTCH & 
& RCLMUL & NOLIN 

R15 » R15 » RAMF , ADD 
& SDMA & VMA & READ 

& LAD & NORMM & LMM & 
& NODIN & NOSDA & NORCC & 
& NOSTR & MWAMWB & FETCH & 
& KCLMUL & NOLIN 

RQ » RQ » RAMF , SUBR 
& SDMA & NOVMA & READ 
& NOLAD & NORMM & LMM é 
& NODIN 6& NOSDA & NORCC & 
& NOSTR & MWAMWB & NOFTCH & 
& RCLMUL & NOLIN 

R15 » R15 » QREG , ADD 
& NOSDMA & NOVMA & READ 
& NOLAD & NORMM & LMM & 
& DINHI & NOSDA & NORCC 6& 
& NOSTR & MWAMWB & NOFTCH & 
& RCLMUL & NOLIN 

R15 » RIS » RAMF , OR 
& SDMA & VMA & READ 
& LAD & NORMM & LMM & 
& NODIk & NOSDA & NORCC & 
& NOSTR & MWAMWB & FETCH & 
& RCLMUL & NOLIN 

RQ » k9 » RAMF , SUBR 
& SDMA & NOVMA & READ 
& NOLAD S& NORMM & LMM & 
& NODIN & NOSDA & NORCC & 
& NOSTR & MWAMWB & NOFTCH & 
& RCLMUL & NCLIK 

R15 » R15 » QREG , ADD 
& NOSDMA & NOVMA & READ 

& NOLAD & NORMY & LMM & 
& DINHI & NOSDA & NORCC & 
& NOSTR & MWAMWB & NOFTCH & 
& RCLMUL & NOLIN 

R1S » R15 » RAMF , OR 
& SDMA & VMA & READ 

& LAD & NORMM & LMM & 
& NODIN & NOSDA & NORCC & 
& NOSTR & MWAMWB & FETCH & 
& RCLMUL & NOLIN 

RO » RB » QREG , AND 
& NOSDMA & NOVMA & READ 

& NOLAD §& NORMM & LMM & 
& NODIN & NOSDA & NORCC & 
& NOSTR & RSRS & NOFTCH & 
& RCLMUL & NOLIN 

Ro » RO » NOOP , SUBR 
& SDMA & VMA- & READ 
& NOLAD & NORMM & LMM & 
& DINHI §& NOSDA & NORCC & 
& NOSTR & MWAMWB & FETCH & 
& RCLMUL & NCLIN 

R15 » R15 » QREG , ADD 
& SDMA & NOVMA & READ 
& NOLAD & NORMM 6& LMM & 
& DINHI & NOSTA & NORCC & 
& NOSTR & MWAMWB & NOFTCH & 
& RCLMUL & NOLIN 

R15 » R15 » QREG , ADD 
& SDMA & NOVMA & READ 
& NOLAD & NORMM & LMM & 
& DINHI & NOSDA & NORCC & 
& NOSTR & MWAMWB & NOFTCH & 
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NOCIA & RCLMUL & NOLIN 


AM2901 
NOINE 
BA 
NOSWP 
NCLDI 
NOCIA 


AM2961 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM29@1 
NOINE 
BA 
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AM2961 
NOINE 
BA 
NOSWP 
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AM2901 
NOINE 
BA 
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BA 
NOSWP 
NOLDI 
NOCIA 
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AM2901 
NOINE 
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AM2901 
NOINE 
BA 
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AM2961 
NOINE 
BA 
NOSWP 
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AM2901 
NOINE 
NOBA 
NOSWP 
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AM2901 
NOINE 
BA 
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AM2901 
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R10 » RB » RAMF , AND » DA 
§&§ NOSDMA & NOVMA & READ 
& NOLAD & NORMM & LMM & NOCLB 
& NODIN & SDA & NORCC & LCCLCV 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & LIN & DATA H#OOFF 
R15 » R15 » RAMA, ADD » ZA 
& SDMA & VMA_~ & READ 
& LAD & NORMM & LMM & NOCLB 
& NODIN & NOSDA & NORCC & LCCLCV 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R8 » RS » NOOP , SUBS , DA 
& SDMA §& NOVMA & READ 
& NOLAD § NORMM & LMM & CLRLO 
& NODIN & NOSDA 6& NORCC & LCC 

& KOSTR & MWAMWE & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R8 » R8 » NOOP , SUBR , DA 
& NOSDMA & NOVMA & READ 

& NOLAD & NORMM & LMM & CLRLO 
& DINHI §& NOSDA & NORCC & LCC 

& NOSTR & MWAMWB & NCFTCH & NOCIB 
& RCLMUL & NOLIRK 

RO » RO » NOOP , ADD » AQ 
& NOSDMA & NOVMA & READ 

& NOLAD & NORMM & LMM & NCCLB 
& NODIN & NOSDA & NORCC & LCCLCV 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R15 » R15 » RAMF , ADD » 2Q 
& SDMA & VMA & READ 

& LAD & NORMM & LMM & NOCLB 
& NODIN & NOSDA & NORCC & LCCLCY 
&'NOSTR & MWAMWB & FETCH 6& NOCIB 
& RCLMUL & NOLIN 

RO » RB » RAMU , ADD » ZA 
& NOSDMA & NOVMA & READ 

& NOLAD & NORMM & LMM & NOCLB 
& NODIN & NOSDA & NORCC & LNZCCV 
& NOSTR & RSRD & NOFTCH & NOCIB 
& LSLISR & NOLIN 

RO » RB » NOOP , ADD » 2B 
& SDMA & VMA~ & READ 
& NOLAD & NORMM & LMM & NOCLB 
& NODIN & NOSDA & NORCC & LCV 

& NOSTR & RSRD & FETCH & NOCIB 
& RCLMUL & NOLIN 

RO » RB » RAMD , ADD » 2A 
& NOSTMA & NOVMA & READ 

& NOLAD §& NORMM & LMM & NOCLB 
& NODIN & NOSDA & NORCC & LNZCCV 
& NOSTR & RSRD & NOFTCH & NOCIB 
& DRLASR & NOLIN 

RO » RO » RAMU , ADD » ZA 
& NOSDMA & NOVMA & READ 
& NOLAD & NORMM & LMM & NOCLB 
& NODIN 6& NOSDA & NORCC & LNZCCV 
& NOSTR & RSRD §& NOFTCH & NOCIB 
& ROLROR & NOLIN 

R@ » RO » RAMD , ADD » 2A 
& NOSDMA & NOVMA & READ 
& NOLAD & NORMM 6& LMM & NOCLB 
& NODIN & NOSDA & NORCC & LNZCCV 
& NOSTR & RSRD 6& NOFTCH & NOCIB 
& ROLROR & NOLIN 

R@ » RO » RAMU , ADD » ZA 
& NOSDMA & NOVMA & READ 
& NOLAD & NORMM & LMM & NOCLB 
& NODIN & NOSDA & NORCC & LNZCCV 
& NOSTR & RSRD 6& NOFTCH & NOCIB 
& RCLMUL & NOLIN 

Ro » RO » RAMF , OR » DZ 
& SDMA & VMA~ & READ 
& NOLAD & NORMM 6& LMM & CLRLO 
& NODIN & NOSDA & RCC & LCCLCYV 
& NOSTR & RDRD 6& FETCH & NOCIB 
& RCLMUL & NOLIN 

RO » RO » RAMD , ADD » 2A 
& NOSDMA & NOVMA & READ 
& NOLAD & NORMM & LMM & NOCLB 
& NODIN & NOSDA & NORCC §& LNZCCV 
& NOSTR & RSRD & NOFTCH & NOCIB 
& LSLLSR & NOLIN 

RO » RB » QREG , AND » DA 
& SDMA & NOVMA & READ 
& NOLAD & NORMM & LMM & NOCLB 
& NODIN & SDA & NORCC & LCCLCV 
& NOSTR & RDRD & NOFTCH & NOCIB 
& RCLMUL & LIN & DATA H#SCOF 
Ro » RB » NOOP , ADD » 2B 
& NOSDMA & NOVMA & READ 
& NOLAD & NORMM §& LMM & NOCLB 
& NODIN & NOSDA & NORCC & LCV 
& NOSTR & RSRS & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R@ » RB » RAMU , ADD » ZA 
& SDMA & NOVMA & READ 
& NOLAD & NORMM & LMM & NOCLB 
& NODIN & NOSDA & NORCC & LNZCCYV 
& NOSTR & RSRS & NOFTCH & NOCIB 
& LSLLISR & NOLIN 

RO » RO » QREG , SUBR , 2Q 
& SDMA 6& NOVMA & READ 
& NOLAD & NORMM & LMM & NOCLB 
& NODIN & NOSDA & NORCC §& LCCLCYV 
& NOSTR & MWAMWE & NOFTCH & NOCIB 
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NOCIA & RCLMUL & NOLIN 


AM29@1 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2901 
NOINE 
BA 
NOSWP 
NCLDI 
NOCIA 


AM29@1 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2901 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM 2981 
NOINE 
BA 

NOSWP 
NOLDI 
NOCIA 


AM29@1 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2901 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2901 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM29@1 
NOINE 
BA 
NOSWP 
NOLDI 
CIA 


AM2921 
NOINF 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2961 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2901 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2S21 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM29@1 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM29@1 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AMZ901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2961 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM29901 
NOINE 
BA 
NOSWP 
NOLDI 


Ro ’ 
& SDMA 

& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ’ 
& SDMA 

& NOLAD 
& DINHI 
& NOSTR 
& RCLMUL 


R8 ’ 
S SDMA 

& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ’ 
& SDMA 
& NOLALT 
& NODIN 
& NOSTR 
& RCLMUL 


Ro ’ 
& NOSDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ; 
& NOSDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ’ 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& LSLLSR 


RO ’ 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& ROLROR 


R8 ’ 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RG , 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RQ ’ 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


k@ ’ 
& NOSDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ’ 
& SDMA 

& NOLAD 
*& NODIN 
& NOSTR 
& LSLLSR 


RO ’ 
& SDMA 

& NOLAD 
& NODIN 
& NOSTR 
& ROLROR 


RO ’ 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ’ 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ’ 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


Ro ’ 
& NOSDMA 
& NOLAD 
& NODIN 
& NOSTR 


R@ » QREG , OR » DZ 
& NOVMA & READ 


& NORMM & LMM & NOCLB 
& SDA & worcc & LCCLCYV 
& MWAMWB & NOFTCH & NOCIB 
& LIN & DATA H#OGOF 
RO » QREG , AND » DQ 
& NOVMA & READ 

& NORMM & LMM & CLRLO 
& NOSDA & NORCC & LCCLCV 


& MWAMWB & NOFTCH S NOCIB 


& NCLIN 

R8 » RAMF , AND » DZ 
& NOVMA & READ 

& NORMM & LMM & NOCLB 
& NOSDA §& NORCC & LCCLCV 


S MWAMWB & NOFTCH & NOCIB 
& NOLIN 


RS » QREG , AND » DA 
& NOVMA & READ 


& NORMM & LMM & NOCLB 
& SDA & NORCC & LCCLCV 
& MWAMWB & NOFTCH & NOCIB 
& LIN & DATA E#OO1F 
R@ » NOOP , ADD » ZA 
& NOVMA & READ 

& NORMM & LMM & NOCLB 
& NOSDA & NORCC & LCV 

& RDRD & NCFTCH & NOCIB 
& NOLIN 

RO » NOOP , ADD » AQ 
& NOVMA & READ 

& NORMM & LMM & NOCLB 
& NOSDA & NORCC & LCCLCV 
& MWAMWB & NOFTCH & NOCIB 
& NOLIN 

RO » RAMU , ADD > ZA 
& NOVMA & READ 

& NORMM & LMM & NOCLB 
& NOSDA & NORCC & LNZCCV 
& RSRS & NOFTCH & NOCIB 
& NOLIN 

RO » RAMU , ADD » ZA 


& NOVMA & READ 

& NORMM & LMM & NOCLB 

& NOSDA & NORCGC & LNZCCYV 
& RDRD & NOFTCH & NOCIB 

& NOLIN 

R8 » RAMF , ADD » ZB 
& NOVMA & READ 

& NORMM & LMM & NOCLB 

& NOSDA & NORCC & LCCLCV 


& MWAMWB & NOFTCH & NOCIB 
& NOLIN 


RO » QREG , SUBR , 2Q 
& NOVMA & READ 

& NORMM & LMM & NOCLB 
& NOSDA & NORCC & LCCLCV 
& MWAMWB & NOFTCH & NOCIB 
& NOLIN 


RO » QREG , AND » DA 
& NOVMA & READ 

& NORMM & LMM & NOCLB 
& SDA & NORCC & LCCLCV 
& MWAMWB & NOFTCH & NOCIB 


& NOLIN & DATA H#061F 

RB » NOOP , ADD » ZA 
& NOVMA & READ 

& NORMM 6& LMM & NOCLB 

& NOSDA & NORCC & LCV 

& RDRD & NOFTCH & NOCIB 

& NOLIN 

RO » RAMU , ADD » 2A 
& NOVMA & READ 

& NORMM & LMM & NOCLB 

& NOSDA & NORCC & LNZCCYV 
& RSRS & NOFTCH & NOCIB 

& NOLIN 

RO » RAMU , ADD » ZA 
& NOVMA & READ 

& NORMM & LMM & NOCLB 

& NOSDA & NORCC & LNZCCYV 
& RDRD & NOFTCH & NOCIB 

& NOLIN 

RO » QREG , SUBR , ZQ 
& NOVMA & READ 

& NORMM & LMM & NOCLB 

& NOSDA & NORCC & LCCLCV 


& MWAMWB & NOFTCH & NOCIB 


& NOLIN 

R@ » NOOP , ADD + ZB 
& VMA & READ 

& NORMM & LMM & NOCLB 
& NOSDA & NORCC & LNZ 

& RDRD & FETCH & NOCIB 
& NOLIN 

RO » QREG , ADD » ZA 
& NOVMA & READ 

& NORMM & LMM & NOCLB 
& NOSDA & NORCC & LCCLCV 
& RDRD & NOFTCH & NOCIB 
& NOLIN 

Rd » RAMF , ADD + ZA 


& NOVMA 
& NORMM 
& NOSDA 
& RSRD 


& READ 

& LMM & NOCLB 
& NORCC & LCCLCY 
& NOFTCH & NOCIB 


VNRR: 


= ~ mR ESS 


OVRR: 


DDRR: 


DCRR: 


n SRR RNR RR BR RS 


UBRR: 


BCRR: 


NDRR: 


ORRR: 


ORRR: 


SSS PS NS NOR NG RRR RN RRR RRR RRR RRM RRS 
ro 
wa 
= 


_ 
2 
Q 
w 
= 
° 


ECRR: 


OMRR: 


NNN NNO ANN NN OD RRR SRS 


se] 
Q 
w 
~ 


WPRR: 


ANNAN NNN NN SE 


SUBRR: 


~NN NNR SN 


BRANCH INSTR 


BRANCH INSTR 


BRANCH INSTR 


BRANCH INSTR 


INSTR 


BRANCH 


BRANCH INSTR 


BRANCH INSTR 


BRANCH INSTR 


BRANCH INSTR 


BRANCH INSTR 


INSTR 


BRANCH 


BRANCH INSTR 


INSTR 


BRANCH 


INSTR 


BRANCH 


BRANCH INSTR 


CONTNUE 


BRANCH INSTR 


BRANCH INSTR 


AM2981 
NOINE 
NOBA 
NCSWP 
LDI 
NOCIA 
AM2961 
NOINE 
NOBA 
NOS#P 
LDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM29@1 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2981 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM29@1 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM2961 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2961 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2961 
NOINE 
NOBA 
NOSWP 
LDI 
NCCIA 


AM2961 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM29@1 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM29¢1 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM29@1 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2921 
NOINE 
NOBA 
SWPHL 
LDI 
NOCIA 


MPOMMMM MMMMMM MMMMMM MMMMAM MAMMAMM MMMMMM MMM MMM PMMMMM MMMMMM MMMMMM MMMMMM MMMM MM MMMMMM MMMMMM MOMMMM MMM MMMOMMOMM M 


RB ’ 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 
ko ’ 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


k@ 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ’ 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ’ 
& SDMA 

& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO : 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ’ 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


Ro ‘ 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


Ro ’ 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ' 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ’ 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO 

& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ' 
& SDMA 

& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ’ 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO , 
& SDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ’ 
& NOSDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ’ 
& SDMA 

& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


& AM2991 R® 


& NOINE 
& NOBA 

& NOSWP 
& LDI 

& CIA 


SDMA 
NOLAD 
NODIN 
NOSTR 
RCLMUL 


‘al 40 48 4 4) 


NOCIA & RCLMUL & NOLIN 


Ro ’ 
& VMA 
& NORMM 
& NOSDA 
& RSkS 
& NOLIN 
RO ’ 
& VMA 
& NORMM 
& NOSDA 
& RSED 
& NOLIN 


1 RO ’ 


& VMA 
& NORMM 
& NOSDA 
& RSRD 
& NOLIN 


RQ ’ 
& VMA 
& NORMM 
& NOSDA 
& RSRD 
& NOLIN 


RO ’ 
& VMA 
& NORMM 
& NOSDA 
& RSRD 
& NOLIN 


Ro ’ 
& VMA 
& NORMM 
& NOSDA 
& RSRD 
& NOLIN 


RO ’ 
& VMA 
& NORMM 
& NOSDA 
& RSRD 
& NOLIN 


RO ’ 
& VMA 
& NORMM 
& NOSDA 
& RSRD 
& NOLIN 


RO ’ 
& VMA 
& NORMM 
& NOSDA 
& KSRL 
& NOLIN 


RO ’ 
& VMA 
& NORMM 
& NOSDA 
& kKSRD 
& NOLIN 


RO ’ 
& VMA 
& NORMM 
& NOSDA 
& RSRL 
& NOLIN 


» RO , 


& VMA 
& NORMM 
& NOSDA 
& RSRD 
& NOLIN 


RO ’ 
& VMA 
& NORMM 
& NOSDA 
& RSRD 
& NOLIN 


RO ’ 
& VMA 
& NORMM 
& NOSDA 
& RSRD 
& NOLIN 


RO ’ 
& VMA 
& NORMM 
& NOSDA 
& RSRD 
& NOLIN 


RO ’ 
& NOVMA 
& NORMM 
& NOSDA 
& RSRD 
& NOLIN 


RO ’ 
& VMA 
& NORMM 
& NOSDA 
& RSRD 
& NOLIN 


» RO 


& VMA 
& NORMM 
& NOSDA 
& RSRD 
& NOLIN 


» RAMF 


RAMF , ADD » 2Q 
& READ 
& LMM & NOCLB 
& norcc & LCCLCYV 
& FETCH & NOCIB 
RAMF , ADD » 2A 
& READ 
& LMM & NOCLB 
& nORCC & LCCLCV 
& FETCH & NOCIB 


RAMF , ADD 
& READ 

& LMM & NOCLB 
& NORCC & LCV 

& FETCH & NOCIB 


» ZA 


RAMF = , ADD » AB 
& READ 
& LMM & NOCLB 
& NORCC & NOLCC 
& FETCH & NOCIB 


RAMF , ADD » AB 
& READ 
& LMM & NOCLB 


& NORCC & NOLCC 
& FETCH & CIB 


RAMF , SUBk , AB 
& READ 
& LMM & NOCLB 
& NORCC & NOLCC 
& FETCH & NOCIB 


RAMF , SUBR , AB 
& READ 
& LMM & NOCLB 
& NORCC & NOLCC 
& FETCH & CIB 


RAMF , AND » AB 
& READ 

& LMM & NOCLB 
& NORCC & LCV 

& FETCH & NOCIB 


RAMF , OR » AB 
& KEAD 
& LMM & NOCLB 
& NORCC & LCV 
& FETCH & NOCIB 


RAMF , EXOR , AB 
& READ 
& LMM & NOCLB 
& NORCC & LCV 


& FETCH & NOCIB 


NOOP , SUBR , AB 
& READ 
& LMM & NOCLB 


& norcC & NOLCC 
& FETCH & NOCIB 


RAMF , ADD » ZA 
& READ 

& LMM & NOCLB 
& NORCC & LCV 


& FETCH & NOCIB 


RAMF , SUBR , ZA 
& READ 

& LMM & NOCLB 
& NORCC & LCV 


& FETCH & NOCIB 


RAMF , EXNOR , ZA 


& READ 
& LMM & NOCLB 
& NORCC & LCV 


& FETCH & NOCIB 


RAMF , SUBS , ZA 
& READ 
& LMM & NOCLB 
& nORCC & NOLCC 
& FETCH & NOCIB 


NOOP , ADD » ZA 
& WRITE 

& LMM & NOCLB 
& NORCC & LCCLCV 
& NOFTCH & NOCIB 


RAMF , OR » DZ 
& READ 

& LMM & NOCLB 
& NORCC & LCV 


& FETCH & NOCIB 


» SUBS , AB 
& READ 

& LMM & NOCLB 

& NORCC & NOLCC 


& FETCH & NOCIB 


311 


312 


MOVAX: 


~N NS 


VNXA: 


NNR NNR ORR 


OVXA: 


DDXA: 


NN RRR OR RRR ORR NR RR 
i“ 
z 
oe 


DCXA: 


~ NNN RR NKR 


UBXA: 


NNN NNO? AN ~AN ON 


BCXA: 


~NN NNR ORS 


NDXA: 


~N NN NOD ~~ NNN 


~~N NNN 
oO 
i] 
bl 
>= 
oe 


~ NNN Nt 


~~N NNN 


CONTNUE 


BRANCH IFETCH 


CONTNUE 


BRANCH INSTR 


CONTNUE 


BRANCH INSTR 


CONTNUE 


BRANCH IFETCH 


CONTNUE 


BRANCP IFETCEH 


CONTNUE 


BRANCH IFETCH 


CONTNUE 


BRANCH IFETCH 


CONTNUE 


BRANCH IFETCH 


CONTNUE 


BRANCH IFETCH 


MPOMMAM MOMAMM MMOMMMM MMMAMM MMMOMMMH MADMAMH MMMMOM MAMAOM MOMMMM MMMM MM MMHMAMM MMMMMM MMMMAMM MMMAMAMM MMOMMMM MMBMMMM MMM HAMM POMAMM HO 


AM29@1 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM29@1 
NOINE 
NOBA 
NOSWP 
NOLDI 
NOCIA 


AM2961 
NOINE 
NOBA 
NOSwWP 
LDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2981 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2981 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2901 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM29@61 
NOINE 
NOBA 
NOSwP 
LDI 
NOCIA 


AM29@1 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2991 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2901 
NOINE 
BA 
NOSWP 
NOLDI 
CIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2961 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM29@1 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM29@1 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2981 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


Ril » R11 + NOOP , ADD » DA 
& NOSDMA & NOVMA & READ 

& LAD & NORMM & LMM & CLRLO 
& DINHI & NOSDA & NORCC & LCCLCV 
& NOSTR §& MWAMWB & NOFTCH S NOCIB 
& RCLMUL & NOLIN 

R10 » R10 » RAMA =, OR » ZA 
& NOSDMA & VMA & WRITE 

& NOLAD & NORMM & LMM & KOCLEB 
& NODIN & NOSDA & NORCC & LCV 

& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R11 » R11 » NOOP. , ADD » DA 
& NOSDMA & VMA- & READ 

& LAD & NORMM & LMM & CLRLO 
& DINHI & NOSDA & NORCC & LCCLCV 
& NOSTR & MWAMWB & KCFTCH & NOCIB 
& RCLMUL & NOLIN 

R15 » R16 » RAMA , OR » DZ 
& SDMA & VMA & READ 

& LAD & NORMM & LMM & NOCLB 
& DINHL & NOSDA & NORCC & LCCLCYV 
& NOSTR & MWAMWB & FETCR & NOCIB 
& RCLMUL & NOLIN 

R11 » R11 » NOOP , ADD » DA 
& NOSDMA & VMA & READ 

& LAD & NORMM & LMM & CLRLO 
& DINHI 6& NOSDA & NORCC & LCCLCV 
& NOSTR §S& MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R15 » R18 » RAMA, OR » DZ 
& SDMA & VMA & READ 

& LaD & NORMM & LMM & NOCLB 
& DINHL §& NOSDA §& NORCC & LCV 
& NOSTR & MWAMWB & FETCH §& NOCIB 
& RCLMUL & NOLIN 

R11 » Ril » NOOP , ADD » DA 
& SDMA & \VMA & READ 

& LAD & NORMM & LMM & CLKLO 
& DINHI §& NOSDA & NORCC & LCCLCY 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R10 » R10 » RAMF , ADD » DA 
& NOSDMA & NOVMA & READ 

& NOLAD & NORMM & LMM & NOCLB 
S DINHL 6& NOSDA & NORCC & NOLCC 
& NOSTR & MWAMWE & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

Ril » R11 » NOOP , ADD » Da 
& SDMA & MA & READ 

& LAD & NORMM & LMM & CIRLO 
& DINHI & NOSDA §& NORCC & LCCLCV 
& NOSTR & MWAMWH & NOFTCH & NOCIE 
& RCLMUL & NOLIN 

R18 » RID » RAMF , ADD » DA 
& NOSDMA & NOVMA & READ 
& NOLAD & NORMM & LMM & NOCLB 
& DINHL §S NOSDA & NORCC & NOLCC 
& NOSTR & MWAMWR & NOFTCH & CIB 

& RCLMUL & NOLIN 

R11 » R11 » NOOP , ADD » DA 
& SDMA & VMA & READ 
& LaD & NORMM & LMM & CLRLO 
& DINHI & NOSDA & NORCC & LCCLCV 
& NOSTR & MWAMWE & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R18 » RIO » RAMF , SUBR , DA 
& NOSDMA & NOVMA & READ 
& NOLAD & NORMM & LMM & NOCLB 
& DINHL & NOSDA & NORCC & NOLCC 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R11 » R11 » NOOP , ADD » DA 
& SDMA & MA & READ 

& LaD & NORMM & LMM & CLRLO 
& DINHI & NOSDA & NORCC & LCCLCYV 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R18 » R18 » RAMF , SUBR , DA 
& NOSDMA & NOVMA & READ 

& NOLAD & NORMM & LMM & NOCLB 
& DINHL 6& NOSDA & NORCC & NOLCC 
& NOSTR & MWAMWB & NOFTCH & CIB 
& RCLMUL & NOLIN 

R11 » R11 » NOOP_ , ADD » DA 
& SDMA & VMA & READ 

& LaD & NORMM & LMM & CLRLO 
& DINHI 6& NOSDA & NORCC & LCCLCYV 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R18 » R11 » RAMF , AND » DA 
& NOSDMA & NOVMA & READ 

& NOLAD 6& NORMM & LMM & NOCLB 
& DINHL & NOSDA & NORCC & LCV 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

Ril » R11 » NOOP_ , ADD » DA 
& SDMA & VMA & READ 
& LaD & NORMM & LMM & CLRLO 
& DINHI & NOSDA & NORCC §& LCCLCV 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R16 » R1@ » RAMF , OR » DA 
& NOSDMA & NOVMA & READ 
& NOLAD & NORMM & LMM & AOCIB 
& DINHL & NOSDA S NCRCC & LCV 
& NOSTR & MWAMwR & NOCFTCH & NOCIE 
& RCLMUL & NOLIN 


XORXA: 


~~ NS 


MPXA: 


~NRNRNRNRO RRRRS 


~~ NNN 


~ NN NN 


QP = 3 Q 
~< os ~ 
ee oe oe 


SSSI LS ~n~N NS 
9 
~ 
oe 


SUBXA: 


~~AN RD NR NS 


OUT: 


~NNNN ES ~N NNN ~~NN NS 


MVNM+R: 


VNRM-: 


CONTNUE 


BRANCH IFETCH 


CONTNUE 


BRANCH IFETCH 


CONTNUE 


BRANCH IFETCH 


CONTNUE 


BRANCH IFETCH 


CONTNUE 


BRANCH IFETCH 


CONTNUE 


BRANCH IFETCH 


CONTNUE 


BRANCH IFETCH 


CONTNUE 


BRANCH IFETCH 


CONTNUE 


BRANCH IFETCH 


CONTNUE 


QP MMAMMM AMAMMM MAMMMOMM MMMM MM MMMMMM MOHMMMOMM MOMMMM MMMM MM MAMOMM AMAMAAMAMAMAMMMH AMAMAOMOM MAMMMMH MMMAAMOM MOMMM MH SOMMAMM MAM HMMM MMMAMM 


AMz29€1 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2961 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2901 
NOINE 
BA 
NOSWP 
NOLDI 
CIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
NOLDI 
CIA 


AM2981 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2981 
NOINE 
NOBA 
NOSWP 
NOLDI 
NOCIA 
AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
NOLDI 
NOCIA 


AM2991 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
NOLDI 
CIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2981 
NOINE 
NOBA 
SWPHL 
NOLDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2901 
NOINE 
BA 
NOSWP 
NOLDI 
CIA 


AM2901 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2901 
NOINE 
NCBA 
NOSWP 
NOLDI 
NOCIA 


AM29@1 


R11  , R121 , NOOP , ADD , CA 
& SDMA  & VMA & READ 
& LAD & NORMM & LMM & CIRIO 
& DINHI & NOSTA S& NORCC 6 LCCLCV 
& NOSTR & MWAMWE & NOFTCEK & NOCIB 
& RCLMUL & NOLIN 

R1@ , R10 =, RAMP , EXCR , DA 
& NOSDMA & NOVMA & READ 
& NOLAD & NORMM & LMM & NOCLP 
& DINHL & NOSDA & NORCC & LCV 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R11  , R12. =, NOOP , ADD, DA 
& SDMA & VMA & READ 
& LAD & NORMM & LMM & CLRLO 
& DINHI & NOSDA & NORCC & LCCLCV 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R1@  , R1@ , NOOP , SURBK , DA 
& NOSDMA & NOVMA & READ 
& NOLAD §& NORMM & LMM & NOCLE 
& DINHL 6& NOSTA & NORCC §& NOLCC 
& NOSTR & MWAMWB & NOFTCE & NOCIB 
& RCLMUL & NOLIN 

R11 , R11 , NOOP , ADD, DA 
& NOSDMA & VMA & READ 
& LAD & NORMM & LMM & CLPLO 
& DINBI 6& NOSDA & NORCC & LCCICV 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R8 » ke , KAMF , ADD, D2 
& NOSDMA & VMA_ & WRITE 
& NOLAD & NORMM & LMM & NOCLB 
& DINBL & NOSDA & NORCC & LCV 
& NOSTR 6& MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R11 , R11 , NOOP , ADD , DA 
& NOSDMA & VMA_ & READ 

& LAD & NORMM & LMM & CLRLO 
& DINHI & NOSDA 6& NORCC & LCCLCV 
& NOSTR & MWAMPR & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

RS » RB , RAMF , SUBS , DZ 
& NOSDMA & YMA & WRITE 

& NOLAD & NORMM & LMM & NOCLB 
& DINHL S NOSDA 6& NORCC 6& LCV 

& NOSTR & MWAMWR & NOFPTCH & NOCIB 
& RCLMUL & NOLIN 

Rit  , Rit , NOOP , ADD, DA 
& NOSDMA & VMA  §& READ 
& LAD & NORMM & LMM & CLRiO 
& DINHI §& NOSDA & NORCC & LCCLCV 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R8 » R8 , RAMF , EXNOR , DZ 
& NOSDMA & VMA  & WRITE 
& NOLAD & NORMM & LMM & NOCLB 
& DINHL & NOSDA & NORCC & LCV 
& NOSTR 6& MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R11 , R11 , NOCP , ADD , DA 
& NOSDMA & VMA & READ 
& LaD & NORMM & LMM & CLRLO 
& DINHI 6& NOSDA & NORCC & LCCLCV 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

RS » R8 » RAMF , SUBR , D2 
& NOSDMA & VMA & WRITE 
& NOLAD & NORMM & LMM & NOCLB 
& DINHL & NOSDA & NORCC & NOLCC 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

Rit  , R11 , NOOP , ADD , DA 
& NOSDMA & VMA & READ 

& LAD & NORMM & LMM & CLRLO 
& DINHI & NOSDA & NORCC & LCCLCV 
& NOSTR & MWAMOB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

RS » RB » RAMF , OR , D2 
& NOSDMA & YMA & WRITE 

& NOLAD & NORMM & LMM & NOCLB 
& NODIN 6& NOSDA & NORCC & LCV 

& NOSTR & MWAMWB & NOFTCH S NOCIB 
& RCLMUL & NOLIN 

Rl1  , Ri1 , NOOP , ADD , DA 
& SDMA & YMA & READ 

& ‘LAD & NORMM & LMM & CIRLO 
& DINHI & NOSDA & NORCC §& LCCLCV 
& NOSTR & MWAMWR & NOFTCE & NOCIB 
& RCLMUL & NOLIN 

R1@  , R1@ , RAMF , SUBS , DA 
& NOSDMA & NOVMA & READ 

& NOLAD & NORMM & LMM & NOCLB 
& DINHL & NOSDA & NORCC & NOLCC 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

RO » Re , RAMF , SUBR , ZB 
& NOSDMA & NOVMA & READ 

& LAD & NORMM & LMM & NOCLB 
& NODIN & KOSDA & NORCC & LCCLCV 
& NOSTR & RDRD & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

RO » RO , RAMA , ADD , ZB 
& NOSDMA & VMA_ & WRITE 

& NOLAD & NORMM & LMM & NOCLB 
& NODIN & NOSDA & NORCC & LCCLCV 
& NOSTR & RSKS  & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

RO , R® , RAMA , ADD, ZA 
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/ NOINE & SDMA & VMA  & READ / NOINE & SDMA & VMA & READ 
/ NOBA & LAD & NORMM & LMM & NOCLB / NOBA & LAD & NORMM & LMM & NOCLB 
/ NOSWP & NODIN & NOSDA & NORCC & LCCLCV / NOSWP & NODIN & NOSDA & NORCC & LCCLCV 
/ LDI & NOSTR & RSRS & NOFTCH & NOCIB / LDI & NOSTR & RSRS & NOFTCH & NOCIB 
/ CIA & RCLMUL & NOLIN / CIA & RCLMUL & NOLIN 
MVNIN: BRANCH IFETCH & AM2901 RO » RO » RAMF , OR » DZ ADCIN: BRANCH AFETCH AM2991 RO , RO ,» RAMF , ADD , DA 
/ NOINE & NOSDMA & NOVMA & READ / NOINE & NOSDMA & NOVMA & READ 
/ BA & NOLAD & NORMM & LMM & NOCLB / BA & NOLAD & NORMM & LMM & NOCLE 
/ NOSWP & DINHL & NOSDA & NORCC & LCCLCV / NOSWP & DINHL & NOSDA & NORCC & NOLCC 
/ NOLDI & NOSTR & RSRD & NOFTCH & NOCIB / NOLDI & NOSTR & RDRD & NOFTCH & CIB 
/ NOCIA & RCLMUL & NOLIN / NOCIA & RCLMUL & NOLIN 
MVNI+R: CONTNUE AM2961 RO » RB » RAMA , ADD, ZA ADCI+R: CONTNUE AM2961 R@ » RG » RAMA , ADD, ZA 
/ NOINE & NOSDMA & VMA & READ / NOINE & NOSDMA & VMA & READ 
/ NOBA & LAD & NORMM & LMM & NOCLB / NOBA & LAD & NORMM & LMM & NOCLB 
/ NOSWP & NODIN & NOSDA & NORCC & LCCLCV / NOSWP & NODIN & NOSDA & NORCC & LCCLCV 
/ LDI & NOSTR & RSEBS & NOFTCH & NOCIB J LDI & NOSTR & RSRS & NOFTCH & NOCIB 
/ CIA & RCLMUL & NOLIN / CIA & RCLMUL & NOLIN 

BRANCH MVNIN AM2981 RO » RO » NCOP , OR » DZ BRANCH ADCIN AM2901 6 » RO » NOOP , OR , DZ 
/ NOINE & SDMA & VMA_ & READ / NOINE & SDMA & VMA  & READ 
/ NOBA & LAD & NORMM & LMM & NOCLB / NOBA & LAD & NORMM & LMM & NOCLE 
/ NOSWP & DINHL & NOSDA & NORCC & LCCLCV / NOSWP & DINHL & NOSTA §& NORCC & LCCLCV 
/ LDI & NOSTR & MWAMWB & NOFTCH & NOCIB / LDI & NOSTR & MWAMWB & NOFTCH & NOCIB 
/ NOCIA & RCLMUL & NOLIN / NOCIA & RCLMUL & NOLIN 
MVNZR:  CONTNUE AM2901 R15 , R15 , RAMA , ADD , 2A ADCZR:  CONTNUE AM29@01 R15 , R153 , RAMA , ADD , ZA 
/ NOINE & NOSDMA & VMA_ & READ / NOINE & NOSDMA & VMA_ & READ 
/ NOBA & LAD & NORMM & LMM & NOCLB / NOBA & LAD & NORMM & LMM & NOCLB 
/ NOSWP & NODIN & NOSDA & NORCC & LCCLCV / NOSWP & NODIN & NOSDA & NORCC & LCCLCV 
/ LDI & NOSTR & MWAMWB & NOFTCH & NOCIB / LDI & NOSTR & MWAMWB & NOFTCH & NOCIB 
/ CIA & RCLMUL & NOLIN / CIA & RCLMUL & NOLIN 

BRANCH MVNIN AM2991 RO » RB » NOCP , ADD , DA BRANCH ADCIN AM2901 RO , RO » NOOP , ADD, DA 
/ NOINE & SDMA & YMA & READ NOINE & SDMA & VMA & READ 
/ NOBA & LAD & NORMM & LMM & NOCLB NOBA & LAD & NORMM & LMM & NOCLB 
/ NOSWP & DINHL & NOSDA & NORCO & LCCLCV NOSWP 6& DINHL & NOSDA & NORCC & LCCLCV 
/ LDI & NOSTR & RSRD & NOFTCH & NOCIB LDI & NOSTR & RSRS & NOFTCH & NOCIB 
/ NOCIA 6& RCLMUL & NOLIN NOCIA & RCLMUL & NOLIN 
MOVM+R: CONTNUE AM29@1 RO » RO » RAMA , ADD =, ZA UBM+R: CONTNUE AMZ2981 R@ » RO » RAMA , ADD, 2A 
/ NOINE & SDMA & YMA & READ NOINE & SDMA & VMA  & READ 
/ NOBA & LAD & NORMM & LMM & NOCLB NOBA & LAD & NORMM & LMM & NOCLB 
/ NOSWP & NODIN & NCSDA & NORCC & LCCLCV NOSWP & NODIN & NOSDA & NORCC & LCCLCV 
/ LDI & NOSTR & RSRS-~ & NOFTCH & NOCIB LDI & NOSTR & RSRS & NOFTCH & NOCIB 
/ CIA & RCLMUL & NOLIN CIA & RCLMUL & NOLIN 
MOVIN: BRANCH IFETCH & AM2901 RO » RO » RAMF , OR » D2 UBIN: BRANCH IFETCH & AM2901 R@ » RO » RAMF , SUBR , DA 


NOINE & NOSDMA & NOVMA & READ 

BA & NOLAD & NORMM & LMM & NOCLB 
NOSWP & DINHL & NOSDA & NORCC & NOLCC 
NOLDI & NOSTR & RDRD & NOFTCH & NOCIB 
CIA & RCLMUL & NOLIN 


NOINE & NOSDMA & NOVMA & READ 

BA & NOLAD & NORMM & LMM & NOCILB 
NOSWP & DINHL & NOSDA & NORCC & LCV 
NOLDI & NOSTR & RSRD & NOFTCR & NOCIB 
NOCIA & RCLMUL & NOLIN 


AM2981 RO » RO RAMA , ADD » 2A 
NOINE & NOSDMA & VMA & READ 

NOBA & LAD & NORMM & LMM & NOCLB 
NOSWP & NODIN & NOSDA & NORCC & LCCLCY 
LDI & NOSTR & RSRS & NOFTCH & NOCIB 
CIA & RCLMUL & NOLIN 


AM2961 R® + RB » NOOP , OR » DZ 


OVI+R: CONTNUE AM29@1 R®@ » RO » PAMA , ADD » ZA UBI+R: CONTNUE 
NOINE & NOSDM4 & VMA & READ 

NOBA & LAD & NORMM & LMM & NOCLB 

NOSWP & NODIN & NOSDA & NORCC & LCCLCV 
LDI & NOSTR & RSRS & NOFTCH & NOCIB 


CIA & RCLMUL & NOLIN 
AM29801 R@ + RO » NOCP , OR » D2 BRANCH SUBIN 


~n™N NR ~~ EN ~N NNN ~~ RS 


BRANCH MOVIN 


~~ NN SS MN NR RRR RRS 
_ 
2 
oe 


NOINE & SDMA & VMA  & READ / NOINE & SDMA & VMA- & READ 
NOBA & LAD & NORMM & LMM & NOCLB / NOBA & LAD & NORMM & LMM & NOCLB 
NOSWP & DINHL & NOSDA & NORCC & LCCLCV / NOSWP & DINHL & NOSDA & NORCC & LCCLCV 
LDI & NOSTR & MWAMWE & NOFTCF & NOCIB / LDI & NOSTR & MWAMWB & NOFTCH & NOCIB 
NOCIA & RCLMUL & NOLIN / NOCIA & RCLMUL & NOLIN 

OVZR:  CONTNUE AM29@1 R15 , R15  , RAMA , ADD =, ZA SUBZR: CONTNUE AM2991 R15  , R15  , RAMA , ADD , ZA 
NOINE & NOSDMA & VMA & READ / NOINE & NOSDMA & VMA  & READ 
NOBA & LAD & NORMM & LMM & NOCLB / NOBA & LAD & NORMM & LMM & NOCLB 
NOSWP & NODIN & NOSDA & NORCC & LCCLCV / NOSWP & NODIN & NOSDA & NORCC & LCCLCV 
LDI & NOSTR & MWAMWB & NOFTCH & NOCIB / LDI & NOSTR & MWAMWB & NOFTCH & NOCIB 
CIA & RCLMUL & NOLIN / CIA & RCLMUL & NOLIN 

BRANCH MOVIN AM2981 RO » RB » NOOP , ALD , DA BRANCH SUBIN AM2981 RO » R@ , NOOP , ADD , DA 


NOINE & SDMA & YMA & READ 

NOBA  §& LAD & NORMM & LMM & NOCLB 

NOSWP & DINEL & NCSDA & NORCC & LCCLCV 
LDI & NOSTR & RSRS & NOFTCH & NOCIB 

NOCIA & RCLMUL & NOLIN 


NOINE & SDMA & VMA & READ 

NOBA & LAD & NORMM & LMM & NOCLB 

NOSWP & DINHL & NOSDA & NORCC & LCCLCV 

LDI & NOSTR & RSRD & NOFTCH & NOCIB 

NOCIA & RCLMUL & NOLIN / 


AM2901 R® » RO » RAMA , ADD » ZA 
NOINE & SDMA & VMA & READ 

NOBA & LAD & NORMM & LMM & NOCLB 

NOSWP & NODIN & NOSDA & NORCC & LCCLCYV 
LDI & NOSTR & RSRS & NOFTCH & NOCIB 

CIA & RCLMUL & NOLIN 


DDM+R: CONTNUE AM29@1 = R® » RB » RAMA, ADD » ZA UBCM+R: CONTAUE 
NOINE & SDMA & VMA & READ 

NOBA & LAD & XORMM & LMM & NOCLB 
NOSWP & NODIN & NOSDA & NORCC & LCCICV 
LDI & NOSTR & RSRS & NOFTCH & NOCIB 


CIA & RCLMUL & NOLIN 


DDIN: BRANCH IFETCH AM2901 = R® » RB » RAMF  , ADD » DA UBCIN: BRANCH IFETCH AM29@1 R@ » RB » RAMF , SUBR , DA 
NOINE & NOSDMA & NOVMA & READ NOINE & NOSDMA & NOVMA & READ 
BA & NOLAD & NORMM & LMM & NOCLB BA & NOLAD & NORMM 6& LMM & NOCLB 


NOSWP & DINHL & NOSDA & NORCC & NOLCC 
NOLDI & NOSTR S&S RDRD & NOFTCH & CIB 
NOCIA & RCLMUL & NOLIN 


NOSWP & DINHL & NOSDA & NORCC §& NOLCC 
NOLDI & NOSTR & RDRD & NOFTCH & NOCIB 
NOCIA & RCLMUL & NOLIN 


AM2901 R@ » RO RAMA , ADD » 2A 
NOINE & NOSDMA & VMA & READ 

NOBA & LAD & NORMM & LMM & NOCLB 
NOSWP & NODIN & NOSDA & NORCC & LCCLCV 
LDI & NOSTR & RSRS & NOFTCH & NOCIB 
CIA & RCLMUL & NOLIK 

AM29@1 k@ » RB » NOCP , OR » D2 


DDI+R: CONTNUE AM2901 R@ » RE RAMA, AD? » ZA UBCI+R: CONTNUE 
NOINE & NOSDMA & VMA_ & READ 

NOBA & LAD & NORMM & LMM & NOCLE 
NOSWP & NODIN & NOSDA & NORCC & LCCLCV 
LDI & NOSTR & RSRS & NOFTCH & NOCIB 


CIA & RCLMUL & NOLIN 


: 
° 


~ NON. ~NN ~~ ONS. ope ~ NR 
~~ NNN ~NN NNO ~ NN NN OD ~S NO TS 


BRANCH SUBCIN 


PP POMMAM MHMHMMMM MOM AMAMM MMMM MM MMMMMM MMMM MM MMMMMM AMMO MM MMMMMM MHMMMM MHMMMM MMMMMM® MMMM AMM MMMMMM MOMMAM MHMMMO MMM MMM MMMM 
AMP MMMM MM MOM MMM MAM MHMMMMMMM SAHMMMM MMMMMAM MAMMMM MMM MMM MM MMMM MMMM MM MAM MMM MMMM MM MMMM OM MAMMMMM MMMAMOM MMMM MM MOM MMM MM MMM 


BRANCH ADDIN AM2901 © » RB » NOOP , OR » DZ / NOINE & SDMA & VMA & READ 
/ NOINE & SDMA & VMA  & READ / NCBA & LAD & NCRMM & LMM & NCCLB 
/ NOBA  & LAD & NORMM & LMM & NOCLB / NOSWP & DINhL S NOSDA & ANCRCC & LCCLCV 
/ NOSWP §S DINHL & NOSDA & NORCC & LCCLCY / LDI & NOSTR & MWAMWE & NOFTCE & AOCIB 
/ LDI & NOSTR & MWAMWB & NOFTCH & NOCIB.,. / NOCIA & RCLMUL & NOLIN 
/ NOCIA & RCLMUL & NOLIN 
SUBCZR: CONTNUE AM29@1 R15  , R15 » RAMA , ADL » ZA 
ADD2R: CONTNUE AM29861 R15 » RIS » RAMA =, ADD + ZA / NOINE & NOSDMA & VMA & READ 
/ NOINE & NOSDMA & VMA_ & READ / NOBA & LAD & NCRYM & LMM & NOCLB 
/ NOBA & LAD & NORMM & LMM & NOCLE / NOSWP & NODIN & NOSDA & NORCC § LCCLCV 
/ NOSWP & NODIN & NOSDA & NORCC & LCCLCV VA LDI & NOSTR & MBAMWB & NOFTCH & NOCIB 
/ LDI & NOSTR & MWAMWB & NOFTCH & NOCIE / CIA & RCLMUL & AOLIN 
/ CIA & RCLMUL & NOLIN 
BRANCH SUBCIN AM2901 R@ » RB » NOCP , ALD » DA 
BRANCH ADDIN AM29861 RO » RO » NOCP , ADD » DA / NOINE & SDMA & VMA & READ 
/ NOINE & SDMA & \VMA & READ / NOBA & LAD & NORMM & LMM & NCCLBE 
/ NOBA & LAD & NORMM & LMM & NOCLB / NOSWP & DINHL & NOSDA & NORCC & LCCLCV 
/ NOSWP & DINEL §& NOSDA & NORCC §& LCCLCOV / LDI & NCSTR & RSKS & NOFTCK & NCCIB 
/ LDI & NOSTR & RSKS & NOFTCH & NOCIB / NOCIA & RCLMUL & NOLIN 
/ NOCIA & RCLMUL & NOLIN 
ANDM+R: CONTNUE AM2901 R@ » RO » RAMA, ADL Paes 
ADCM+R: CONTNUE AM2901 RO » RB » RAMA , ADD 2 ZA / NOINE & SDMA & VMA & READ 
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/ 
/ 
/ 
/ 
yaaa BRANCH IFETCH 
/ 
/ 
/ 
/ 
ANDI+R: CONTNUE 
/ 
/ 
/ 
/ 
/ 
BRANCH ANDIN 
/ 
/ 
/ 
/ 
/ 
ANDZR: CONTNUE 
/ 
/ 
/ 
/ 
/ 
BRANCH ANDIN 
/ 
/ 
/ 
/ 
/ 
IORM+R: CCNTNUE 
/ 
/ 
/ 
/ 
/ 
IORIN: BRANCH IFETCH 
/ 
/ 
/ 
/ 
/ 
TORI+h: CONTNUE 
/ 
/ 
/ 
/ 
/ 
BRANCH IORIN 
/ 
/ 
/ 
/ 
/ 
pore CONTNUE 
/ 
/ 
/ 
/ 


BRANCH IORIN 


ORM+R: 


CONTNUE 


ORIN: 


BRANCH IFLTCH 


CRI+PF: CGONTNUE 


~~ o NNR S ~S™N RA NB ~~wN NS 


BRANCE XORIN 


ORZR: CONTNUE 


~~ NL, ~~ 


BRANCH XORIN 


MPM+R: CONTNUE 


~aQ NNN 


MPOMMMM MOMS MM MOPOMOOM MONMMMM MAMMMM MOMMMM MMNMMMM MAMMMMH MMMM MM MMMMAAM MOHMMMM MMOMMAMM MA MMH 


PP MMMM MM MAM OM MAIMMMM MMMM MH MOM MAM 


NOBA 
NOSWP 
LDI 
CIA 


AM29@1 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2981 
NOINE 
NOBA 
NCSWP 
LDI 
CIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2991 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2981 
NOINE 
NOBA 
NCSWP 
LDI 
CIA 


AM2901 
NOINE 
BA 
hOSWP 
NOLDI 
NCCIA 


AM2921 
NOINE 
NOBA 
NCSwP 
LDI 
CIA 


AM2901 
NCINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM29@1 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM29@1 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM2981 
NOINE 
BA 
NOSWP 
NOLDI 
NOCIA 


AM2961 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM2981 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2991 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM2981 
NOINE 
NOBA 
NOSWP 
LDI 
NOCTA 


AM2901 
NOINE 


& LAD & NORMM & LMM & 
& NODIN & NCSDA & NORCC & 
& NOSTR & RSRS & NOFTCH & 
& RCLMUL & NOLIN 

RO , Ro , RAMP, AAD 
& NOSDMA & NOVMA & READ 

& NOLAD & NORMM & LMM & 
& DINHL & NOSDA & NORSC & 
& NOSTR & RDRD & NOFTCH & 
& RCLMUL & NOLIN 

RO , kB , RAMA , ADD 
& NOSDMA & VMA & READ 

& LAD & NORMM & LMM 6 
& NODIN & NOSDA & AORCC & 
& NOSTR & RSRS S&S NOFTCH & 
& RCLMUL & NOLIN 

RO , RO , NOOP , OR 
& SDMA & VMA & READ 

& LAD & NORMM & LMM & 
& DINKEL & NOSDA & NORCC & 
& NOSTR & MWAMWB & NOFTCH & 
& RCLMUL & NOLIN 

R15, R15, RAMA, ADD 
& NOSDMA & VMA  & READ 

& LAD & NORMM & LMM é 
& NODIN & NCSDA & XORCC & 
& NCSTR & MWAMWB & NCFTCH & 
& RCLMUL & NOLIN 

RO , Ro , NOCP , ADD 
& SDMA & YMA & READ 

& LAD S NCRMM §& LMM & 
& DINHL & NCSDA & NORCC & 
& NCSTR & RSPS  & NOFTCE & 
& RCLMUL & NOLIN 

RO , RO , RAMA , ADD 
& SDMA & VMA & RFAD 

& LAD & NORMM & LMM & 
& NODIN & NOSDA & NORCO & 
&§ NOSTR S$ RSKS  & NOFTCH & 
& RCLMUL & NOLIN 

RO , Re , RAMF , OR 
& NOSDMA & NOVMA & READ 

& NOLAD & NORMM & LMM & 
& DINHL & NOSDA & NORCC & 
& NCSTR & RDRD & NCFTCH & 
& RCLMUL & NOLIN 

RO , R@ , RAMA , ADD 
& NOSDMA & VMA_ & READ 

& LAD & NORMM & LMM & 
& NODIN & NOSDA & NORCC & 
& NOSTR & RSRS  & NOFTCH & 
& RCLMUL & NOLIN 

RO , RO , NOCP , OR 
& SDMA & VMA & READ 

& LAD & NORMM & LMM iS 
& DINKL & NCSDA & NORCC & 
& NOSTR & MWAMWB & NOFTCH & 
& RCLMUL & NOLIN 

R15 , R15, RAMA, ADD 
& NOSDMA & VMA & READ 

& LAD & NORMM & LMM & 
& NODIN & NOSDA & NORCC 6 
& NOSTR & MWAMbB & NOFTCH & 
& RCLMUL & NCLIN 

RO , Re , NOOP , ADD 
& SDMA & VMA  & READ 

& LAD & NORMM & LMM & 
& DINHL & NOSDA & NORCC 6 
& NOSTR & RSRS & NOFTCH & 
& RCLMUL & NOLIN 

RO » RB » RAMA, ADD 
& SDMA & VMA  & READ 

& LAD & NORMM & LMM & 
& NODIN & NOSDA & NORCC & 
& NOSTR & RSRS §& NOFTCH & 
& RCLMUL & NOLIN 

RO , Re , RAMF , EXOR 
& NOSDMA & NOVMA & READ 

& NOLAD & NORMM & LMM & 
& DINHL & NOSDA & NORCC & 
& NOSTR & RDRD & NOFTCH & 
& RCLMUL & NOLIN 

RO , R@ , RAMA , ADD 
& NOSDMA & VMA & PEAD 

& LAD & NORVM §& LMM & 
& NODIN & NOSDA §& NORCC & 
& NOSTR & RSRS  & NOFTCH & 
& RCLMUL & NCLIN 

RO , RB , NOCP , OR 
& SDMA & VA & READ 
& LAD S NORMM & LMM & 
& DINEL & NCSDA & NCRCC & 
& NOSTR & MWAMBB & NOFTCE & 
& RCLMUL & NCLIN 

R15, R15, RAMA, ADD 
& NOSDMA & VMA  & READ 
& LAD & NORMM & LMM & 
& NODIN & NOSDA & NORCC § 
& NOSTR & MWAMWB & NOFTCH & 
& RCLMUL & NOLIN 

Re , R@ , NOOP , ADD 
& SDMA & VMA & READ 
& LAD & NORMM & LMM & 
& DINHL §& NOSDA & NORCC & 
& NOSTR & RSRS  & NOFTCH & 
& RCLMUL & NOLIN 

RG » RB , RAMA , ADD 
& SDMA & VMA & READ 


NOCLB 
LCCLCV 
NCCIE 


» DA 


NCCLE 
LCV 
NOCIB 


» ZA 


NOCLB 
ICCLCV 
NCCIB 


» 02 


NOCLB 
LCCLCV 
NCCIB 


» ZA 


KOCLB 
LCCLCV 
NCCIB 


» DA 


NOCLB 
LCCLCV 
NOCIB 


» ZA 


NOCLB 
LCCLCV 
NOCIB 


» DA 


NOCLB 
ICV 
NOCIB 


» ZA 


NOCLB 


LCCLCV 
NOCIB 


» DZ 


NOCLB 
LCCLCV 
NOCIB 


NOCLB 
LCCLCV 
NOCIB 


NOCLB 
LCCLCV 
NOCIB 


» 2A 
NCCLB 
LCCLCV 
NOCIB 

» DA 
NOCLB 
LCV 
KOCIB 

, ZA 
NOCLB 
LCCLCV 
NCCIF 

» DZ 
NOCLB 


LCCLCV 
NCCIB 


» ZA 
NOCLB 
LCCLCV 
NCCIB 

» DA 
NOCIB 


LCCLCV 
NCCIB 


, ZA 


/ 
/ 
/ 
/ 
CMPIN: BRANCH IFETCH 
/ 
/ 
/ 
/ 
/ 
CMPI+R: CONTNUE 
/ 
/ 
/ 
/ 
/ 
BRANCH CMPIN 
/ 
/ 
/ 
/ 
/ 
CMPZR: CONTNUE 
/ 
/ 
/ 
/ 
/ 
BRANCH CMPIN 
/ 
/ 
/ 
/ 
/ 
RSBM+R: CONTNUE 
/ 
/ 
/ 
/ 
/ 
pee BRANCH IFETCH 
/ 
/ 
/ 
/ 
HSBI+K: CONTNUE 
/ 
/ 
/ 
/ 
/ 
BRANCH RSUBIN 
/ 
/ 
/ 
/ 
/ 
RSUBZR: CONTNUE 
/ 
/ 
/ 
/ 
/ 
BRANCH RSUBIN 
/ 
/ 
/ 
/ 
/ 
INCMR: CONTNUE 
/ 
/ 
/ 
/ 
/ 
INCOUT: BRANCH IFETCH 
/ 
/ 
/ 
/ 
/ 
INCI+R: CONTNUE 
/ 
/ 
/ 
/ 
/ 
BRANCH INCOUT 
/ 
/ 
/ 
/ 
/ 
INCZR: CONTNUE 
/ 
/ 
/ 
/ 
/ 
BRANCH INCOUT 
/ 
/ 
/ 
/ 
/ 
DECMR: CONTNUE 
/ 


MM MOMMMM MAM MMM MMMMMM MMO MHMM MAHMMMOM SMMOMMMH MMMMMM MAHMMMMHM MAMHOM MOMM MM MMMM POMPOM M POMPOM MM KMHOMMMM MMMMMM MMAMMAM MAMMMM MHAMO 


NOBA 
NCSWP 
LDI 
CIA 


AM2901 
NOINE 
BA 
NCSWP 
NOLDI 
CIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM29@1 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AMZS@1 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM2901 
NOINE 
BA 
NOoWP 
NOLDI 
CIA 


AM2921 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2961 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOCSWP 
LDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
NOLDI 
CIA 


AM2S01 
NOINE 
NOBA 
NOSwWP 
LDI 
CIA 


AM29@1 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2981 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM29@1 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2981 


NOINE & NOSDMA & VMA 


& LAD & NORMM = & LMM & NCCLB 
& NODIN & NOSDA & NORCC §& LCCLCV 
& NOSTR & RSRS  & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

RO , RO » NOOP , SUBR , DA 
& NOSDMA & NOVMA & RFAD 
& NOLAD & NORMM §& LMM & NOCLB 
& DINHL & NOSDA & KCRCC & AOLCC 
& NOSTR & RDKD & NOFTCH & ACCIB 
& RCLMUL & NOLIN 

RO , RO , RAMA , ADD, ZA 
& NOSDMA & WMA & READ 
& LAD & NORMM & LMM & NCCLB 
& NODIN & NOSDA & RORCC & LCCLCV 
& NOSTR S RSRS  §& NOFTCH & NCCIE 
& RCLMUL & NOLIN 

RO , RO , NOOP , OR , 02 
& SDMA & VMA  & READ 
& LAD & NORMM & LMM & NCCLB 
& DINHL §& NOSDA & NORCO & LCCLCV 
S$ NOSTR & MWAMWB & NOFTCH S NCCIB 
& RCLMUL & NOLIN 

R15 , R15  , RAMA , ADD, ZA 
& NOSDMA & VMA 4 READ 

& LAD & NCRMM & LMM & NOCLB 
& NODIN & NOSEA & NORCC & LCCLCV 
& NCSTR & MWAM¥B & NOFTCH & ACCIB 
& RCLMUL & NCLIN 

RO » RO , NOOP , ADD , DA 
& SDMA & VMA & READ 
& LAD & NORMM & LMM & NOCLB 
& DINPL & NOSDA %& NORCC §& LCCLCV 
& NOSTR 6& RSRS  & NOFTCH & NOCIB 
& KCLMUL & NOLIN 

RO » RO , RAMA , ADD, ZA 
& SDMA & VMA & READ 

& LAD & NORMM & LMM & NOCLB 
& NODIN & NOSDA & NOKCC & LCCLCV 
& NOSTR & RSRS & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

RO , RO , RAMF , SUBS , DA 
& NOSDMA & NOVMA & READ 

& NOLAD & NORMM & LMM & NOCLB 
& DINHL & NCSTA & KORCC & NOLCC 
& NOSTR & RDRD & NOFTCH & NCCIB 
& RCLMUL & NOLIN 

Re , RB , RAMA , ADD, 2A 
& NOSDMA & VMA  & READ 

& LAD & NOKMM & LMM & NOCLB 
& NODIN & NOSDA & NORCC & LCCLCV 
& NOSTR & RSRS & NCFTCH & NCCIB 
& RCLMUL & NCLIN 

RQ » Re » NOOP , OR , v2 
& SDMA & VMA & READ 

& LAD & NORMM & LMM & NOCLB 
& DINEL & NOSDA & NORCO & LCCLCV 
& NCSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R15 , R15  , RAMA , ADD , ZA 
& NOSDMA & VMA  & READ 

& LAD & NORMM & LMM & NOCLB 
& NODIN & NOSDA & NORCC & LCCLCV 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
S RCLMUL & NOLIN 

RO » RO » NOCP , ADD , DA 
& SDMA & VMA & READ 

& LAD & NORMM & LMM & NOCLB 
& DINHL & NOSDA & NORCC & LCCLCV 
& NOSTR & RSPS & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

RO , RB » RAMA , ADD, ZA 
& NOSDMA & VMA  & READ 

& LAD & NORMM & LMM & NOCLB 
& NODIN & NOSDA & NORCO & LCCLCV 
& NOSTR & RSRS  & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

RO » RO , RAMF , ADD, DZ 
& NOSDMA & VMA & WRITE 

& NOLAD & NORMM & LMM & NOCLB 
& DINFL S& NCSDA §& NORCC & LCV 

& NOSTR & RDKD & NOFICH & NOCIB 
& RCLMUL & NOLIN 

RO , RO ,» RAMA , ADD, ZA 
& NOSDMA & VMA  & READ 

& LAD & NCRMM & LMM & NOCLB 
& NODIN & NOSDA & ACRCC & LCCLCV 
& NOSTR & RSRS  & NOFTCH & NOCIB 
& RCLMUL & NCLIN 

RO , RB , NOOP , OR (De 
& NOSDMA & YMA & READ 

& LAD & NORMM 6& LMM 6 NOCLB 
& DINKL & NOSDA & NCRCC & LCCLCV 
& NOSTR & MbAMWB & NOPTCH & NOCIB 
& RCLMUL & NOLIN 

R15 , R15  , RAMA , ADD , ZA 
& NOSDMA & WMA & RAD 

& LAD & NORMM & LMM & NOCLE 
& NODIN & NOSDA & NORCC & LCCLCV 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NCLIN 

RO , RB , NCOP , ADD , DA 
& NOSDMA & WMA & READ 

& LAD & NORMM & LMM & NOCLB 
& DINHL & NOSDA % NORCC & LCCLCV 
& NOSTR 6& RSRS 6& NCFTCH & NCCIB 
& RCLMUL & NOLIN 

Re » RO , RAMA , ADD , ZA 


& READ 


/ 

/ 

/ 

/ 

DECOUT: BRANCH IFETCH 
/ 

/ 

/ 

/ 

/ 

pote? CCNTNUE 
/ 

/ 

/ 

/ 

BRANCH CECOUT 

/ 

/ 

/ 

/ 

/ 

DECZR: COATNUE 
/ 

/ 

/ 

/ 

/ 


BRANCH CECOUT 


OMVR: 


CONTNUE 


OMOLT: 


BRANCH IFETCH 


OMI+R: CONTNUE 


~N NNN OO ~NNN NO ~~ NNN OD nnn NS 


BRANCH COMOUT 


/ 

/ 

/ 

/ 

/ 

COMZR: CONTNUE 
/ 

/ 

/ 

/ 

/ 

BRANCH COMCUT 

/ 

/ 

/ 

/ 

/ 

NEGMR: CONTNUE 
/ 

/ 

/ 

7 

/ 

NEGOUT: BRAACH IFETCH 
/ 

/ 

/ 

/ 

/ 

yeaa CONTNUE 
/ 

/ 

/ 

/ 


BRANCH NEGOUT 


CONTNUE 


SP ~~ 
i?) 
Nn 
Lo] 
oe 


BRANCH NEGOUT 


WPMR: CONTNUE 


~N mo ~~ NS 
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NOBA 
NOSWP 
LDI 
NOCIA 


AM29@1 
NOINE 
NOBA 
NOSWP 
NOLDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSwP 
LDI 
CIA 


AM2981 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 
AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM29@1 
NOINE 
NOBA 
NOSWF 
LDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
NOLDI 
NOCIA 


AM2991 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2981 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM 2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM29@1 
NCINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AMZ9O1 
NOINE 
NOBA 
NCSWP 
NOLDI 
CIA 


AMZ2901 
NOINE 
NCBA 
NOSWP 
LDI 
CIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


AM29@1 
KOINE 
NOBA 


& LAD & NORMM & LMM & NOCLB 
& NODIN & NOSDA S NORCC & LCCLCV 
& NOSTR & RSRS & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

RO » RB » RAMF  , SUPBS , DZ 
& NOSDMA & VMA & WRITE 
& NOLAD S NCRMM & LMM & NOCLB 
& DINHL € ROSDA & NOFCC & LCV 

& NOSTR & RDAL &S NOFTCH & NOCIB 
& RCLMUL & NOLIN 

Ro » RO » RAMA , ADD » ZA 
& NOSDMA & VMA & RKFAD 
& LAD & NORMM & LMM & nCCLB 
& NODIN & NCSDA 6& NORCC & LCCLCV 
& NOSTR §& RSRS & NOFTCE & NOCIF 
& RCLMUL & NOLIN 

Re » RB » NOCP , OR » DZ 
S NOSDMA & VMA & RPAD 
& LAD & NORMM & LMM & NOCLB 
& DINHL & NOSDA & NORCC & LCOCLCV 
& NOSTR & MWAMWB & NCFTCH & NOCIB 
& RCLMUL & NOLIK 

R15 » R15 » RAMA , ADD » ZA 
& NOSDMA & VMA & READ 

& LAD & NORMM §& LMM & NOCLB 
& NCDIN & NOSDA & NCKCC & LCCLCV 
& NOSTR 4& MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

RO » RO » NOOP , ADD » DA 
& NOSDMA & VMA & READ 
& LAD & NORMM & LMM & NOCLB 
& DINRL & NOSDA & NCRCC & LCCLCV 
& NOSTR & RSRS & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R@ » RO » RAMA , ADD » ZA 
& NOSDMA & VMA & READ 

& LAD & NORMM & LMM & NOCLB 
& NODIN & NOSDA §& NORCC & LCCLCV 
& NOSTR & RSRS & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

RO » RO » RAMF , EXNOR , DZ 
& NOSDMA & VMA & WRITE 

& NOLAD & NORMM & LMM & NOCLB 
& DINHL & NCSDA 6&6 NORCC & LCV 

& NOSTR & RDRD & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

Ro » Re » RAMA , ADD » ZA 
& NOSDMA & WMA & READ 

& IAD & NOPMM & LMM & NOCLB 
& NODIN & NOSDA & NORCC & LCCLCV 
& NOSTR & RSRS & NOFTCH & NOCIE 
& RCLMUL & NOLIN 

RO » RO » NOCP , OR » DZ 
& NOSDMA & VMA & READ 
& LAD & NORMM & LMM & NOCLB 
& DINHL & NOSDA & NORCC & LCCLCV 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

R15 » R1I5 » RAMA , ADD » ZA 
& NOSDMA & VMA & READ 

& LAD & NORMM & LMM & NOCLB 
& NODIN & NOSDA & NORCC & LCCLCV 
& NOSTR & MWAMWB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

RQ » RO » NCOP , ADD >» DA 
& NOSDMA & VMA & READ 
& LAD & NORMM 6& IMM & NOCLB 
& DINHL §& NOSDA & NORCC §& LCCLCV 
& NOSTR & RSRS & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

Rg » R@ » RAMA , ADD » ZA 
& NOSDMA & VMA & READ 

& LAD & NORMMK & LMM & NOCLB 
& NODIN §& NOSDA & NORCC & LCCLCV 
& NOSTR & RSRS & NOFTCH & NOCIF 
& RCLMUL & NOLIN 

RG » R® » RAMF , SUBR , DZ 
& NOSDMA & VMA & wWhITE 
& NOLAD & NORMM § LMM & NCCLB 
& DINPL & NOSDA & NORCC & NCLCC 
& NOSTR & RORT § NOFTCH & NOCIE 
& RCLMUL % NOLIN 

Rd » RO » RAMA , ADD » ZA 
& NOSDMA & VMA & RFAD 
& LAD & NORMM & LMM & NCCLB 
& NCDIN & NOSDA 6 NORCC & LCCLCV 
& NOSTR & RSKS & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

RO » RO » NCOP , OR » LZ 
& NOSDMA & VMA & READ 
& LAD & NORMM & LMM & NOCLB 
& DINHL & NOSDA & NORCC & LCCLCV 
& NOSTR & MWAMWE & NOFTCH $ NOCIB 
& RCLMUL & NOLIN 

R15 » R1iS » RAMA , ADD » ZA 
& NOSDMA & VMA & READ 
& LAD & NORMM & LMM & NCCLE 
& NODIN & NOSDTA & NORCC & LCCLCV 
& NOSTR & MeAMbB & NOFTCH & NOCIB 
& RCLMUL & NOLIN 

Re » RE » NOOF , ADD » DA 
& NOSDMA & VMA & READ 

& LaD & NORMM & LMM & NOCLB 
& DINFL & NOSDA & NORCC & LCCLCV 
& NOSTR & RSRS & NCFTCH & NCCIE 
& RCLMUL & NOLIN 

RG » RB » RAMA , ADD » ZA 
& NOSDMA & VMA & READ 

& LAD & NORMM & LMM & NOCLB 


WPOUT: 


WPI+R: 


~NNNO ~N RN D ~ oS 


~N NS 


END 


BRANCE IFETCH 


CONTNUE 


BRANCH SWPOUT 


CCATNUE 


BRANCH SWPOUT 


POMMMM MAMM MM MMMAMM a ‘4m‘al 


PMMM MM MMM MMM 


NOSWF & NODIN & NOSTA 


LDI 
NOCIA 


AM2901 
NOINE 
NOBA 
SWPHL 
NOLDI 
NOCIA 


AM2981 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM 2981 
NOINL 
NOBA 
NOSWP 
LDI 
NOCIA 


AM2921 
NOINE 
NOBA 
NOSWP 
LDI 
CIA 


AM2901 
NOINE 
NOBA 
NOSWP 
LDI 
NOCIA 


& NOSTR 
& RCLMUL 


Re ’ 
& NOSDMA 
& NOLAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ’ 
& NOSDMA 
& LaD 
& NODIN 
& NOSTR 
& RCLMUL 


R@ ’ 
& NCSDMA 
& LAD 
& DINHL 
& NOSTR 
& RCLMUL 


R15 - 
& NOSDMA 
& LAD 
& NODIN 
& NOSTR 
& RCLMUL 


RO ; 
& NOSDMA 
& LAD 
& DINHL 
& NOSR 
& RCLMUL 


& nORCC & 
& RSKS & NOFTCH & 
& NCLIN 
R@ » FAMF =, OR 
& VMA & WRITE 
& NORMM & LMM & 
& NOSTA 6& NORCC 6& 
& RDRD & NOFTCR & 
& NOLIN 
R@ » RAMA , ADD 
& VMA & READ 
& NORMM & LMM & 
& NCSDA & NORCC 6& 
S RSRS & NOFTCE & 
& NOLIN 
RO + NOOP , OR 
& VMA & READ 
& NORMM & LMM & 
& NOSDA & NORCC & 


& MWAMWE & NOFTCE & 


& NOLIN 
R15 » RAMA, ADD 
& VMA & READ 


& NORMM & LMM & 
& NOSTA & NORCC & 
& MWAMWB & NOFTCH & 
& NOLIN 


RO » NCOP , ADD 
& VMA & RFAD 

& NCRMM §& LMM & 
& NOSDA & NORCC 6 
& RSKS & NCFICH & 
& NOLIN 


LCCLCV 
NOCIB 


» DZ 


NOCLB 
LCV 
NOCIB 


» ZA 
NOCLB 
LCCLCV 
hOCIFE 

» [2 
NOCLB 
LCCLCV 
NOCIB 

» 2A 
NOCLB 
LCCLCV 
NOCIE 

» DA 
NOCLB 


LCCLCV 
NOCIB 
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Chapter IX 
Super Sixteen 


INTRODUCTION 


The AMD 16-Bit Computer design is an example of a high-speed 
microprocessor system which takes full advantage of AMD’s 
Am2900 Family of Bipolar microprocessor circuits to provide an 
economical, high performance, self contained 16-bit computer. 
It was designed to demonstrate the principles of a micropro- 
grammed system. 


This design ts intended to show some of the techniques used to 
achieve high performance. This inciudes pipelining at the micro- 
program level as well as pipelining at the macro or machine 
instruction program level. A powerful instruction set is demon- 
strated which allows the user to write efficient programs in a 
minimum amount of time. 


One of the unique features of the design Is that in addition to using 
the high performance Am2900 Bipolar microprocessor family, It 
takes advantage of the MOS peripherals normally associated 
with MOS microprocessors. These are used to perform the 
slower functions, particularly in the I/O interface area. 


SYSTEM ORGANIZATION 


The 16-Bit Computer ts designed to perform in a system environ- 
ment as shown in Figure 1. The system consists of a central 
processing unit (the 16-Bit Computer), memory units, I/O units 
(peripheral controllers), and a bus controller. These units com- 
municate over the system bus consisting of a 16-bit wide address 
bus, 16-bit wide bi-directional data bus, and a control bus. The 
control bus is a collection of signals that include the memory and 
/O interface controls and the interrupt request lines. 


BUS 
CONTROLLER 


ADDRESS BUS 
DATA BUS 


CONTROL BUS 


This organization allows systems to be configured with more than 
one CPU and multiple memory and I/O units. The bus controller 
arbitrates requests for bus use from the CPU’s or 1/O units that 
require DMA transfers. 


This application note concentrates on the design of the CPU 
portion of the system. 


INSTRUCTIONS 


An instruction ts either one or two 16-bit words in length and must 
be located in main memory on an integral word boundary The left 
most eight bits of the instruction is always the operation code, 
followed by two, 4-bit register designation fields (Figure 2). The 
16-bit (one word) instruction is always this format. The 32-bit (two 
words) instruction has the first (left most) word exactly like the 
16-bit instruction. The second word of the 32-bit instruction is 
always full 16-bit value (d) which acts as a memory reference 
address or an immediate value (Figure 3). This architecturally 
simple instruction format becomes very powerful when im- 
plemented on a microprogrammed machine. 


The 8-bit opcode provides for 256 primary instructions, which ts 
usually more than enough for most general purpose computers. 
The 4-bit register fields (R; and Ry) each designate one of the 
sixteen, 16-bit registers (Rg-R;5). Depending upon the operation, 
each register can act as either an accumulator for arithmetic and 
logic operations, or an index register In modulo address arithme- 
tic. On operations where the result is placed in a register, the R, 
field depicts the destination register and Ro (or Ro+d) Is, or points 
to the source field in main memory. On operations where the 


DEVICE 


/O 
einer INTERFACE 
UNIT 





Figure 1. System Organization. 


15 8 4 0 





Figure 2. 16-Bit Instruction (RR, RS, SS). 


result is transferred from a register to memory, the R, field depicts 
the source register and Ro (or Re+d) points to the destination 
memory location. Memory to memory transfers will have Ro as 
the source pointer and R, as the destination pointer. Even though 
the R; and Rg fields are architecturally wired to the Am2903 
register address inputs, variations of the source/destination as- 
signment may be implemented via microcode. 


The complete defined standard instruction set is given in Table 1. 
This ts a typical “machine level” instruction set. It allows manipu- 


opcove pom | me 16-BIT VALUE (¢) 


31 24 20 





16 


0 


Figure 3. 32-Bit Instruction (RX, RSI). 


319 


320 


FIXED-POINT LOAD/STORE INSTRUCTIONS 
RR, RS, SS, RX, RSI 


Table 1. 16-Bit Computer Instruction Summary Mnemonic Instruction Format. 


RS, RX 


RR, RS, SS, RX, RSI 


RR, RX 


RR, RS, SS, RX, RSI 


RR, RX 


RR, RS, SS, RX, RSI 
RR, RS, SS, RX, RSI 
RR, RS, SS, RX, RS! 


RSI 


RR, RS, SS, RX, RS! 
RR, RS, SS, RX, RSI 


RR, RX 
RR, RX 
RR, RX 


RR, RS, SS, RX, RSI 


RR, RX, RSI 
RR, RX, RSI 
RR, RX, RSI 
RR, RX, RSI 


RR, RX 


RR, RS, RX, RSI 
RR, RS, RX, RSI 
RR, RS, RX, RSI 


LD LOAD 

ST STORE 

FIXED-POINT ARITHMETIC INSTRUCTIONS 

ADD ADD 

ADC ADD WITH CARRY 

SUB SUBTRACT 

SBC SUBTRACT WITH 
CARRY 

AND AND 

OR OR 

XOR XOR 

TSTI TEST IMMEDIATE 

CMP COMPARE 

CMPL COMPARE LOGICAL 

MUL MULTIPLY 

MULU MULTIPLY UNSIGNED 

DIV DIVIDE 

COMP ONES COMPLEMENT 

BYTE INSTRUCTIONS 

LDB LOAD BYTE 

IC INSERT CHARACTER 

STC STORE BYTE 

XCHB EXCHANGE 

BS BYTE SWAP 

CLB COMPARE LOGICAL 
BYTE 

ANDB AND BYTE 

ORB OR BYTE 

XORB XOR BYTE 


SYSTEM INSTRUCTIONS 


LPSW 


SPSW 


EPSW 


SVC 
SETP 
RSTP 
TSTP 
CMPP 


LOAD PROGRAM 
STATUS WORD 

STORE PROGRAM 
STATUS WORD 

EXCHANGE PROGRAM 
STATUS WORD 

SUPERVISOR CALL 

SET BIT PSW 

RESET BIT PSW 

TEST BIT PSW 

COMPLEMENT BIT PSW 


STACK INSTRUCTIONS 
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TR TRANSLATE 
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ARITHMETIC 
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LOGICAL 
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ARITHMETIC 
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LOGICAL 
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ARITHMETIC 
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VO INSTRUCTIONS 
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BAL BRANCH AND LINK 
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lation of bit, byte, word and multibyte data; PUSH/POP single or 
multiple registers to/from stacks; maintain multiple stacks; deci- 
mal, binary and integer arithmetic; byte and word I/O; and 
maintain supervisory control over hardware and software gen- 
erated interrupts. 


Instruction Format 


Many of the instructions have multiple formats. These formats 
depict addressing modes and determine where the source and 
destination fields are located. The defined instruction formats are 
shown in Figure 4. 


INSTRUCTION 


The Program Control Unit 


The Program Control Unit (PCU) under control of the micropro- 
gram is used to update the Program Counter and load this value 
into the Memory Address Register (MAR) for reading instruc- 
tions/data from main memory. The PCU is also used to update the 
stack pointer and compare this value to the stack limits during 
stack operations. As can be seen in Figure 5, the Computer Block 
Diagram, data can be sent to the PCU from the ALU via the 
Transfer Register. The PCU can also output data onto the PCU 
bus to the Y-bus of the ALU via the bi-directional PCU transfer 
drivers. 


ADDRESSING MODE 


REGISTER REGISTER 


REGISTER STORAGE 


STORAGE TO STORAGE 





REGISTER INDEXED STORAGE 


REGISTER STORAGE IMMEDIATE 


Figure 4. Instruction Formats. 


The instructions set consists of nine instruction groups: 


~ Fixed-point load/store 
— Fixed-point arithmetic 
— Byte 

— Shift/rotate 

— Branch control 

=i 

— Stack 

~ Extended 

— System 


Acomplete description of each instruction is given in Appendix A. 


CENTRAL PROCESSING UNIT ARCHITECTURE 


Processor Organization 


The organization of the computer is shown In Figure 5 (Computer 
Block Diagram) The computer ts organized into several distinct 
sections, the Program Control Unit (PCU), the Arithmetic and 
Logic Unit (ALU), and the Computer Control Unit (CCU), the Data 
Path, the Memory Control and Clock Control, and Input/Output 
Interface and Interrupt Section The logic diagrams for the CPU 
are located in Appendix F. Earlier chapters in the Build a Micro- 
computer series have described the principle sections of a com- 
puter and the Am2900 components used in these sections. This 
chapter describes how these components are used to tmplement 
a very high-speed low cost computer. 


Note Figure 5 is sheet 1 of the logic diagrams 


The PCU ts organized around four Am2901’s. The use of 
Am2901’s allow the PCU to generate addresses with the flexibility 
of an ALU chip, to increment the Program Counter by two in one 
microcycle, and to provide the stack pointer registers for in main 
memory stack operations. The registers of these Am2901’s are 
defined as shown in Figure 6 Register 0 holds the program 
counter and Registers 4 and 5 hold constants for incrementing. 
Byte addressing requires the address to be incremented by two 
every time 16 bits of instruction data are fetched. 


The Arithmetic and Logic Unit (ALU) 


The ALU shown in Figure 7 is organized around four Am2903’s. 
The Am2903 performs all of the functions performed by the 
Am2901A but also provides the computer with separate DA bus 
and DB bus input ports as well as additional instructions to im- 
plement multiplication and division. Three major buses connect to 
the ALU. DA, DB and Y buses. The memory data from the Zp 
Register and microcode immediates are brought into the Am2903 
through the DA port while Program Status Bits 16-23 enter via the 
DB port. The Am2903’s output or receive data on the Y bus for 
loading into the RAM registers. The Am2903’s zero decode logic 
detects zero on the Y port whether or not the Y port is receiving or 
sending data. 


To implement the defined instruction set, the RAM register selec- 
tion controls are sent from the Instruction (I) Register to the 
Am2903’s. Ip.3 (used with instructions with the Ro or Xo field) are 
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Figure 6. PCU Register Assignments. 










connected to the A address inputs on the Am2903 while I4_7 are 
connected to the B address inputs. The ALU operations per- 
formed are controlled by microcode bits M7g_gg which are con- 
nected to the Am2903 lpo.g inputs. 


The Am2904 provides the microcode and machine status reg- 
isters holding the carry, negative, zero and overflow status The 
machine status bits C, N, Z and OVR are defined as PSW bits 
16-23. Logic in the Am2904 includes a condition code multiplexer 
to select the true or complement of any of the four status bits and 
combinations of status bits from either the machine or micro- 
status registers or directly from the ALU. This condition code 
multiplexer ts controlled by Instruction Register bits l4.7 which are 
gated to the Am2904 l9_3 inputs during the execution of a condi- 
tional branch. The output of the multiplexer, labeled TEST is 
routed to the test tree for input into the Am2910. The Am2904 also 
provides the shift linkages and shift linkage control and selection 
of the type of carry signal to the ALU and lookahead carry unit. 


The ALU ts designed to work with byte operations as well as 16-bit 
operations. Byte operations operate only on the lower 8 bits of 
register data without affecting the upper 8 bits of data. During byte 
operations the WORD signal (Mgg) goes inactive disabling the 
Write Enable and Output Y Enable for ALU bit slices 3 and 4. The 
word/byte multiplexer circuit will select C, N and OVR status bits 
from ALU bit slice 2 and at the same time ALU bit slice 2 has its 
MSS input pulled LOW to indicate most significant slice. The 
zero Status bit being OR tied to all of the ALU bit slices cannot be 
multiplexed. Instead the Y bus signals 8-15 are forced to zero by 
gating zeroes from the PCU resulting in the Z signal line state 
being a function of ALU bit slices 1 and 2 only. 


The Computer Control Unit 


The Computer Control Unit controls the sequence of execution of 
the microinstructions. The Am2910 Microprogram Controller pro- 
vides the sequencer for the microprogram (see logic diagrams 
Sheet 5). Branch addresses and counter values loaded into the 
Am2910 Do. inputs, originate from the Pipeline Register 
(Mo.14), the interrupt vector decoder, and the machine instruction 
decoder. The instruction decoder, also called Mapping ROM, (a 
512 x 8 PROM) uses the Instruction Register lg_;5 as address bits 
with the PROM outputs being the starting address of the micro- 
code sequence that executes each machine instruction. In this 
design the Am29775 Registered PROM’s are used to provide 
both the microprogram memory (512 x 96 bits wide) and the 
Pipeline Register. The microcode bits Mj¢.29 are output from 
Am29774 because these signals require open collector outputs 
rather than the standard tri-state outputs to allow the Am2910 
inputs Ip.3 to be pulled to zero. 


The starting address generation for the interrupt service routine 
and initialization routine is accomplished with a minimum of extra 
logic. During the last microcode cycle of the previous machine 
instruction, the MAPEN signal ts activated to enable the output of 
the Mapping ROM. However, if an interrupt request is pending, 
the Mapping ROM 1s disabled and the pull-up resistors force the 
eight least significant microprogram branch address lines to all 
ones, vectoring the microprogram to the interrupt service 
routine. After a reset, the microprogram should be vectored to 
address zero, the starting address of the initialization routine. 
This is accomplished by having the reset signal force zeroes into 
the Am2910 lIo.3 inputs which causes the Am2910 to output ad- 
dress zero. 


Clock and Memory Control 


The architecture of this computer achieves its high throughput by 
being able to execute machine instructions in as little as one 
microcycle. This is accomplished by overlapping (also called 
pipelining) the fetch and decode with the execute microcycles. An 
essential part of this design is the memory control section. The 
clock and memory control circuits shown in Sheet 6 of the logic 
diagrams work together to provide a very efficient mechanism for 
integrating memory operations with the computer. The memory 
interface timing 1s a clocked handshaked protocol shown in Fig- 
ure 8. Each memory transfer consists of a Bus Request, Bus 
Acknowledge response, Memory Request, Address Accept re- 
sponse, Data Request and a Data Sync response. At the 
maximum rate a memory interface response can occur 50ns after 
the computer activates a control line. This makes it possible to 
read from main memory once every microcycle (4 x 50ns = 
200ns); however should a particular memory board require a 
longer cycle, it can delay sending Data Sync to the computer to 
extend the cycle. 


The read and write timing are shown in more detail In Figures 9 
and 10. Note that if a memory read is taking place during microcy- 
cle N, the Bus Request, Bus Acknowledge and the start of 
memory address are output from the computer in the previous 
N-—1 cycle, and the data is sent to the computer during the first 
half of the following N+1 cycle. Now consider the case of 
back-to-back main memory read cycles. In this case, in the mi- 
crocycle that the computer sends the address to the memory 
board, the memory board is sending data to the computer; but this 
is not the data associated with the address being received but the 
data associated with the address received during the previous 
microcycle. 


A free running or uncontrolled 2OMHz clock on the backplane ts 
connected to all of the devices which effect memory transfers 
(CPU, bus controller, and memory modules). All of the signal 
handshaking that is required by the memory interface protocol is 
clocked with the same 20MHz clock to ensure no metastable 
conditions occur during memory transfer. Careful examination of 
this memory interface operation will reveal that not only does It 
solve the very serious metastable problem, but also that the clock 
synchronization and bus propagation delay occur during the 
memory read access time (or write time) and do not slow down 
the memory transfer rate. 


The CPU clock generation ts intimately related to the Memory 
Control Logic The CPU clock signals Phase 1 (¢,) and Phase 2 
(2) are shown along with the memory interface signals in Figure 
8. Phase 1 is a square wave set high at the beginning of the 
microcycle and has a period of 200ns. Almost all operations of the 
computer are clocked with the leading edge of ¢;. The clock 
control logic will enable the next cycle only if a Bus Request has 
received a Bus Acknowledge and only tf a Memory Request has 
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Figure 8. Clocked Handshaked Protocol. 


received a Data Sync response. If the bus or memory resources 
of the system are temporarily being used by other processors, the 
computer will stop the clock and wait. 


Data Path 


The Data Path logic incorporates 8-bit wide devices wherever 
possible. The D Register drives directly onto the external data 
bus. Both main memory and |/O data are received through the Z 
Registers. Registers Z, Z) and Z, are actually latches im- 
plemented with Am74S373’s. The Z Register enable latch signal, 
LDZ is derived from the memory control logic and main memory 
board logic both of which are clocked with the uncontrolled 
20MHz clock (2OMHZUNC). Using the uncontrolled clock allows 
the memory operation to go to completion at memory speed even 
when single stepping the microcode. This allows the system to 
use dynamic RAM'’s in the main memory since stopping the 
handshaking circuits during single step would prevent refresh 
Operations from taking place. 


Data from the main memory passes through the Z Register to the 
Zo and Z, Registers. The Zp and Z, Registers are enabled trans- 
parent at the beginning of the microcycle following the read main 
memory microcycle. This allows memory data to flow through the 
Z and Zp Registers (actually latches) to the ALU or flow through 
the Z and 2; Registers to the Instruction Decoder (Mapping 
ROM). The Z, and Zp Registers are locked down halfway through 
the microcycle guaranteeing the computer solid data and making 
it possible to send data from the D-Register out to the external 
Data Bus during the second half of the same microcycle. This is 
another example of how this design tightly dovetails data trans- 
fers in order to gain very high execution rates. 


Interrupt and Input/Output 


The interrupt and I/O section is shown in Sheet 7 of the logic 
diagrams. 


The basic interrupt handling is controlled by the Am2914. In this 
design the Am2914 is used to prioritize and enable interrupts, 
provide the mask register, generate an Interrupt Request and 
Interrupt Vector. Interrupt nesting is done in the machine software 
interrupt handler. The external interrupt request signals (INT o- 
INT7) are input into the Am2914 from the external Control Bus (C 
Bus). When a peripheral controller requests computer servicing, 
it activates its assigned interrupt line. If this interrupt level is 
unmasked and interrupts are enabled, the Am2914 activates the 
INTERRUPT REQ signal that goes to the Computer Control Unit 
which causes the microprogram to vector to the microcode inter- 
rupt service routine. This microcode routine pushes the PSW 
onto the main memory stack, then reads the interrupt vector from 
the Am2914 and uses this value to vector the computer to the 
machine software routine that services the interrupt. 


The Am9519 MOS Universal Interrupt Controller is incorporated 
into the design and its Group Interrupt signal is connected to the 
least significant INTo input of the Am2914. The Am9519 handies 
an additional eight interrupt levels for low speed requesting de- 
vices. This MOS LSI component offers the computer comprehen- 
sive interrupt handling capabilities at low cost. One feature the 
Am9519 offers is the capability of software generated interrupts. 
The console function, single instruction stepping, is implemented 
using a microcode routine that uses the software generated inter- 
rupt capability. 
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Figure 9. CPU Read Timing. 
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Figure 10. CPU Write Timing. 
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The 1/O protocol for the AMD 16-Bit Computer is similar to that 
required to control Am8080/9080 peripheral circuits. As shown in 
Figures 11 and 12, the computer outputs the address over the 
system address bus, activates a control line (e.g., IORD) and 
holds these outputs until receiving a response, IOACK, from the 
peripheral controller. Execution of the I/O operation ts done al- 
most entirely in microcode with the I/O Control Register, a single 
Am2920, being the only additional hardware required. This is an 
example of a design precept followed in this computer which Is to 
implement all features in microcode wherever possible. This re- 
sults in a low cost computer, although sometimes slower, and a 
design that ts flexible and easily modifiable to meet new 
requirements. 


The I/O section has two Am8251/9551 Programmable Communi- 
cation Interface components giving the computer two serial I/O 
Ports, one of which is reserved for the console. The console can 
be any standard RS-232 interface terminal 


Instruction Execution 


To execute instructions, the main steps performed by the com- 
puter are: (1) form memory address, (2) instruction fetch, (3) 
decode, (4) displacement fetch, (5) form operand address, (6) 
operand fetch, and (7) execute. Every instruction type is made up 
of microinstructions that execute these basic steps, but most 
instructions require three steps or less. Instruction sequences for 
Register to Register (RR) and Register to Indexed Storage (RX) 
instructions are shown in Figures 13 and 14 to illustrate how the 
computer operates. These figures show the RR instruction re- 
quiring four microcycles and the typical RX instruction requiring 
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Figure 11. 


IOACKFF (IN CPU) 


STROBE 


seven microcycles. However, as will be explained later, in actual 
operation the effective time for an RR instruction is one microcy- 
cle and three for the RX. 


Form Instruction Address 


During this microcycle the instruction address is formed by having 
the Program Control Unit (PCU) under control of the micropro- 
gram increment the Program Counter by two. This address ts 
then loaded into the MAR and back into the PC. 


At the beginning of the cycle, Bus Request !s activated causing 
the Bus Controller to respond with Bus Acknowledge. The ad- 
dress is then output from the MAR out on the Address Bus 50ns 
prior to the beginning of the next cycle. 


Instruction Fetch 


During this cycle, the main memory ts fetching the contents of the 
address previously generated. The computer is designed to work 
with high-speed main memory capable of reading a memory 
location in one microcycie so that the instruction will be sent back 
to the computer at the beginning of the next cycle. 


Decode Cycle 


The instruction fetched from main memory during the previous 
cycle is sent to the computer at the beginning of the cycle. The 
instruction falls through the Z and Z, Registers (actually trans- 
parent latches) and is routed to the Instruction Decoder (Mapping 
PROM). The Instruction Decoder translates the 8-bit operation 
code of the instruction into an 8-bit address used as the starting 
address for the microprogram that will execute this instruction. 
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Figure 12. I/O Write Timing. 
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Figure 13. RR Instruction Sequence. 
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Figure 14. RX Instruction Sequence. 
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Displacement Fetch Cycle 


After every instruction fetch another read cycle takes place. The 
second memory read will be another instruction fetch or an 
operand displacement fetch. The computer does not know what 
kind of a read out it ts until the instruction decode is finished. For 
an RX instruction, after the memory read is completed, the com- 
puter identifies it as a displacement. 


Form Operand Address Cycie 


The memory word ts sent from the main memory at the beginning 
of this cycle and then passes through the Z and Zp Register and 
goes to the ALU (Am2903’s). The ALU adds the displacement 
and the contents of the register specified by Xp field in the opcode 
and forms an operand address which ts then loaded into the MAR. 
This has to be completed 50ns before the end of the cycle. 


Operand Fetch Cycle 


The memory read cycle is performed and the operand Is sent to 
the computer at the beginning of the next cycie. 


Execute Cycles 


As the name implies, these are the microcycles that perform the 
task of the instruction but with the Am2903’s normally only one 
execute cycle is required; however, some instructions (e.g., I/O 
instructions) take as many as seven execute cycles. 


Simultaneously with the last execute cycle the Instruction De- 
coder is enabled. 


Pipelined Operations 


If the architecture of the computer executed each of the instruc- 
tions and each microstep sequentially, this computer would be 
just another computer relying on a high-speed clock to gain high 
throughput. However, the 16-Bit Computer becomes an excep- 
tional machine by using pipelining techniques. In this approach, 
the instruction steps for the following instructions are done during 
the decode and execute steps of the current instruction. The 
pipelining operation for a Register to Register class of instructions 
is shown in Figure 15. With the pipeline full, note that when 
instruction A is being executed, instruction B is being decoded, 
instruction C is being fetched from Main Memory and the MAR is 
being loaded with the address for instruction D. In the following 
cycle, RR instruction B is executed and RR instructions C, D and 
E proceed through the pipeline. The pipelining technique results 
in an RR instruction effectively being executed in one microcycle. 
As illustrated in Figure 16, anew RX instruction can be executed 
every three microcycles. 


Pipelining is great for throughput, but it is a bear to microcode 
especially the first time through since during any one cycle up to 
four instruction sequences have to be considered. It is not as bad 
as it first appears. Note that an instruction decode cannot take 
place until the last execute cycle of the current instruction. The 
major pipelining takes place during the first three steps: form 
memory address, instruction fetch, and decode. Execute and 
operand fetch steps allow full overlapped operation only during 
the last execute cycle. Instructions that require many execute 
microcycles (e.g., I/O instructions) cause the computer perfor- 
mance to drop down to nearly that of a non-pipelined machine. 


[ation 


= 
Neel 
bod 
ma | 
Address 
oe! 
a 
fo 
Beal 













Fetch Operand 








Pipeline Operation with Regard to Branching 
and Interrupts 


Pipeline operations greatly reduce instruction execution time if 
machine instructions are executed in sequential order; how- 
ever, if a branch is taken this advantage is lost because the 
steps set up in preparation for a decode cycle become use- 
less. The pipeline is said to be “flushed out” when a branch ts 
taken. The RX Branch on Condition instruction has the form: 


DISPLACEMENT 


Where: M is a 4-bit field specifying the conditions for the 
jump. 
(Xz) + displacement is the branch address 






WORD 1 


WORD 2 





Figure 17 shows the sequence chart for a RX Branch on 
Condition instruction. During the microcycle A, the target ad- 
dress K for the branch is formed and loaded into the MAR 
and also the instruction B is fetched for the no branch case. 
By microcycle A», it has been determined to take or not take 
the branch. If the branch is not taken, the MAR ts loaded with 
address B+2, while if the branch is taken, an instruction fetch 
is performed for K and the MAR is loaded with K+2. Finally in 
A3 the next instruction is decoded. By proper microcoding, the 
conditional branch is executed in only three microsteps even 
though the pipeline was “flushed out”. 


A, B, C, D are RR instructions 














Figure 15. Register-to-Register Pipeline Operation. 
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Figure 16. Register-to-Indexed Storage Pipeline Operation. 
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Figure 17. Branch on Condition RX Pipeline Operation. 


As with branching, an interrupt response alters the sequence 
of execution and “flushes” the pipeline. AS was discussed 
previously in the Interrupt and Input/Output section, an inter- 
rupt request blocks the decoding of the next machine instruc- 
tion and causes the Computer Control Unit to vector to the 
interrupt service routine. This microcode service routine 
pushes the PSW consisting of flags and Program Counter 
(PC) value onto the stack. The PC value is the current PC 
value minus 4. It is necessary to back the PC up to two in- 
struction words (4 bytes), because the fetch instruction and 
form instruction address steps in the pipeline at the time of 
the jump to the interrupt microcode sequence have to be re- 
peated when returning to the main machine program. 


MICROINSTRUCTION FORMAT 


All operations of the AMD 16-Bit Computer are under control 
of the microinstruction. Each microinstruction is 96 bits in 
length. The microinstruction format is summarized in Figure 
18. The microinstruction definition is summarized in Figures 
19a and 19b and is detailed in Table 2. 


Figure 20 illustrates the AMDASM® Definition file for the 16-Bit 
Computer. AMDASM® is a meta-assembler developed by AMD 


for writing microprogams. The definition file defines microword 
length (WORD statement), formats (DEF statements) and con- 
stants (EQU statements) for the use of the actual microprogram 
(Figure 31). 


The definition file is divided into 8 parts: 


Am2910 sequencer opcode definitions 

Am2903 ALU opcode definitions 

Am2901A PCU opcode definitions 

Am2904 shift mux and status control definitions 
Datapath control bits definitions 

Memory control bits definitions 

Control strobe and control bits definitions 
Immediate operand field definition 
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Am2910 Sequencer 


Bit 91 of the microword is the input of CCEN of the Am2910. When 
bit 91 is a logical 1, the conditional operations are forced to 
unconditional operations. Bits 19-16 are the input to the instruc- 
tion inputs to the Am2910. Bits 11-0 are the jump address field for 
instructions that need an address operand. 
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Figure 18. Summary of Microinstruction Word Fields. 











ROUTE TOB 


TRANSFER Z TO ZI 
Am2910 


Am2903 
Am2903 
Am2903 
Am2903 
Am2903 
Am2903 
Am2903 
Am2903 
Am2903 
Am2903 
Am2903 
Am2903 


Am2903 IEU WORD/BYTE 


ENABLE TRANSFER REG 
LOAD TRANSFER REG. 


I-REG EN CTR 
l-REG INC/DEC 


PCU TRANS CHIP DISABLE 
PCU TRANSFER REG. 
LOAD MEMORY ADDR. REG 


LOAD D-REG 


LOAD ZI INTO I REG. 


ENABLE ZO — DA 
ENABLE PSW 


SHIFT CNT Am2910 ADDR. 
BRANCH INSTR. EN 


Am2901 
Am2901 
Am2901 
Am2901 
Am2901 
Am2901 
Am2901 
Am2901 
Am2901 
Am2901 
Am2901 


BUS REQUEST 
MEMORY REQUEST 


F-B/Q 


HOLD REQUEST 


MEMORY WRITE/READ 
MEMORY WORD/BYTE 


ENTREG 


LDTREG 
ENCTR 
INC 
PCUCD 


PCU — Y 


LDMAR 
LDD 
ZI > | 
ENZO 
PSW 


SHTCNTEN 


BRIEN 


PCUly 
PCUI3 
PCUls 
PCUI, 
PCUly 
PCUAs 
PCUA, 
PCUAg 
PCUB, 
PCUB, 


MREQ 
HREQ 
WRITE 
MWORD 
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Figure 19a. Micro Control Word Bit Definitions. 
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EN IMMEDIATE > DA BUS 
ROM/IREGEN 

VO CONTROL REG. EN 
Am2914 INTERRUPTS DISABLE 
Am2914 ENlp-ENI3 

Am2904 SHIFT EN 


CrovSv9ovlyBp X X 


GENERAL 
USE 
CONTROL 
BITS 


GE9E ZESEBE OP LP CY 


Am2904 OUT EN CONDITIONAL TEST 
Am2904 EN ZERO 

Am2904 EN CARRY 

Am2904 EN SIGN 

Am2904 EN OVERFLOW 

Am2904 EN MACHINE STATUS 
Am2904 EN MICRO STATUS 

Am2904 |,2 CARRY OUT CNTL 
Am2904 |,, CARRY OUT CNTL 


92 LEBCHCOELECESEVE X X 


Am2904 
Am2904 
Am2904 
Am2904 & Am25LS251 
Am2904 & Am25LS251 
Am2904 & Am25LS251 


aad 
7 
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eo) 
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Figure 19a. Micro Control Word Bit Definitions (Cont.) 


Bits (35-42) 


ENTREG 


LDTREG 
ENCTR 
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ROM/IREGEN | VO Control Register | Am2914 I-l, | Am2904 Shift Enable 
Bit 46 Bit 44 Bit 43 





Figure 19b. Detailed Description of Bits 34 through 47. 


Table 2. Microinstruction Definition. 


Definition 


Routes second register field to B-RAM of Am2903. 
Loads the value in the Z register into the Z, Register at the beginning of the microcycle. 
Enables the CC input of the Am2910. 


These bits control the four Am2903’s. The function of EA, OEY, OEB, and lg_9 Is listed in 
Figure 20. WORD when enabled (LOW) causes the Am2903’s to operate on words (16-bits). 
When disabled (HIGH) the ALU operates on bytes (the least significant byte). This bit disabled 
blocks WE to the upper two Am2903’s and turns off their Y outputs. 


Zeroes should be forced to the upper 8 bits of the Y bus via the PCU to allow the zero status 

to operate correctly when the WORD bit is disabled. Also, when disabled the status (C, OVR, S) 
sent to the Am2904 is taken from the second Am2903 (numbering 0-3 least significant to 

most significant slice) instead of the most significant Am2903. 


Enable Transfer Register — enables the Transfer Register onto the DA input bus of the 
Am2901A’s and Am2903’s. 


Load Transfer Register — loads the Transfer Register from the Y bus. 


Enable | Register Counter — enables the | Register Counter (l7_;4) to count. This value Is 
used to address the general registers during stack instructions and by incrementing or 
decrementing this value the microprogram can read or write successive registers. 


| Register INC/DEC — the value in I7.,4 can be either incremented (if this bit is HIGH) or 
decremented. 


PCU Transceiver Disable — when HIGH this bit disables the PCU Transceivers from receiving 
or transmitting data. 


PCU Transceiver Control — when HIGH this bit allows the PCU Transceivers to pass data 
from PCU to the Y bus. { WORD high (microbit 90) disables the least significant 8 bits of 
these transceivers. | When LOW data passes from the Y bus to the MAR. 


Load Memory Address Register (MAR) — this bit loads the Memory Address Register. 
Load D Register — this bit loads the D Register with data from the Y bus. 


Load Z, into | Register — this bit loads data from Z, into the | Register. The | Register holds 
only the upper 16 bits of the instruction. 


Enable Z) — DA — this bit LOW enables the Zp Register onto the ALU DA. 
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Table 2. Microinstruction Definition. (Cont.) 


ee ee ee 


Enable PSW -— this bit LOW enables the PSW onto the ALU DA. 


SHTCNTEN Shift Count to Am2910 — this bit LOW enables the least significant four bits of the instruction 
(Ig.3) onto the D input to the Am2910 sequencer. This allows the value to be entered into the 
Am2910 internal counter to be used during shift instructions. 


Branch Instruction Enable — this bit LOW enables I4_7 of the Instruction Register onto the 
Am2904 lpo.3 input. The lp.3 inputs control the tests of the status register. 


These bits control the PCU which ts designed around four Am2901’s. The PCUI7, PCUIs, 
PCUl2, PCUI, and PCUlp bits connect directly to the Am2901 ly, Iz, lo, 11 and Ig respectively. 
The PCUAs-PCUAp, and PCUBs-PCUBsy connect to the A and B Address inputs of the Am2901. 
I4, I5, Ig, Ag and Bg are tied to ground. lg Is tied to Iz. 


Request Bus — this bit requests use of the system bus. This request is made the microcycle 
preceding a Memory Request or use of the bus for an I/O transfer. If the request is not honored, 
the processing of the next microinstruction Is halted until the acknowledge is issued. 


Memory Request — this bit requests the memory to do a read or write operation. 
Hold Request — this bit LOW blocks the bus controller from releasing the system bus to 


another device. Normally a Bus Request is cleared as soon as the Bus Acknowledge !s issued. 
HREQ holds Bus Request and prevents any other device from using the bus. 


Memory Write/READ — this bit indicates to the memory the MREQ 1s for a write operation 
(if HIGH) and a read operation (if LOW). 


Memory Word/BYTE — the Memory Word/BYTE microbit specifies whether the memory 
operation will be a word operation or a byte operation. If the operation specified is a byte 
operation the least significant address bit determines which byte of the two byte pair in memory 
Is affected. If the LSBit is a zero, the most significant byte is read or written, and the LSBit is a 
one, the least significant byte is read or written. 


EN Immediate DA Bus — this bit LOW enables the 16-bit immediate value (least significant 
16 bits of the microinstruction) to the ALU DA bus. 


ROM/I REG Enable — this bit enables either the ROM bits 42-35 or the | register bits Ip.7 onto 
the A/B address inputs of the ALU according to the following: 


ROM 42-39 ROM 38-35 


MPR-695 


IOEN I/O Control Register Enable — this bit loads the I/O Control Register with microbits 42-35. 
INTDIS Am2914 Interrupt Disable — this bit disables the Am2914 Interrupt Controller from recognizing 
interrupt requests. 


INTRIEN Am2914 ENlo-ENI3 — this bit is the instruction enable for the Am2914. The instruction inputs Ip_3 
are connected to microbits 35-38 respectively. 


SHFTEN Am2904 Shift Enable — this bit is connected to the shift enable of the Am2904. The shift 
controls Ig.49 are connected to microbits 35-39 respectively. 





Table 2. Microinstruction Definition. (Cont.) 


Ef tition 


This control field is used to provide several different functions as defined by the previously 
described control strobes (microbits 47-48). 


OUT EN CONDITIONAL TEST 
EN ZERO 

EN CARRY 

EN SIGN 

EN OVERFLOW 

EN MACRO STATUS 

EN MICRO STATUS 

CARRY OUT CONTROL 
CARRY OUT CONTROL 


These bits are used to control the Am2904. Their functions 
are defined in Figure 21. OECT 1s used to enable the test 
output of the Am2904 to the CC input of the Am2910. 


These bits determine which test is to be performed for the conditional branch and stack 
functions. The various tests are listed in Figure 25. The testing is done both in the Am2904 


and an 8 to 1 multiplexer. 


291013 
291012 
291011 
291010 


OFNWAAHON OO CO 


Am2903 ALU 


The first 16 equates assign mnemonics for the 18-15 of the 
Am2903 which controls the destination of the ALU result. The 
next 16 equates assign mnemonics for 14-11 of the Am2903 
which control the operations of the ALU. The ALU definition 
indicates the default is the Y bus forced to zero with no operation 
on destination. The next group of definition selects the source 
operand, followed by the special function definitions of the 
Am2903. 


Am2901A PCU 


The PCU definitions include a group of often used PC instructions 
such as PCU. NEXT, PCU. JUMP etc. The PCU definition itself 


These bits are connected to the I3.9 inputs of the Am2910 to control 
the sequencing of the microprogram. Their definitions are listed in Figure 26. 


These bits provide the branch address for the Am2910 and the 16-bit immediate field. 





allows a not predefined instruction be accessible to the micro- 
programmer. 


AM2904 Shift Linkage Multiplexer and Status Register 


The group of equates control the updating of the status register 
and the TEST definition controls the shift linkage multiplexer. The 
Carry control controls the carry into the least significant Am2903 
slice. 


Datapath Control 


The data control equates assign mnemonics to different datapath 
control bits. 
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AMDOS/29 AMDASM MICRO ASSEMBLER, V1.1 
DEFINITION FILE FOR 16 BIT COMPUTER 


CG Seews: ” “Greece Se wewese 
N 
. 


CJS: 
JSB: 
JMAF: 
CJP: 
JMP: 
PUSH: 
PHLC: 
JSRPs 
CIV: 
JMPV: 
JRP: 
RECT: 
RPCT: 
CRIN: 
RIN: 
CJPF: 
LDCT: 
LOOP: 
CONT: 
TWB: 


MOO we Oe we we 


ADR: 
LDR: 
ADRQ: 
LDRQ: 
RPT: 
LDQP: 
QPT: 
RQFT: 
AUR: 
LUR: 
AURQ: 
LURQ: 
YBUS: 
LUQ: 
SINEX: 
REG: 


, 

HIGH: 
SUBR: 
SUBS: 
ADD: 
PASS: 
COMPLS : 
PASSR: 
COMPLR: 
LOW: 
NOTRS: 
EXNOR: 
EXOR: 
AND: 
NOR: 
NAND 


oe 


woe we ee pe we weer O 
o = 
a ee 


> 
w 


ADB: 
AQ: 
DAB: 
DADB: 


o 

>» 
eo 

ee 


oO 
w 
o 


Meee we ee we we wee ee OO we we we a ee we 
i) 
~< 
oe 


USMUL: 
TCMUL: 
INCTWO: 
SMTC: 
TCMLS: 
SLN: 
DLN: 
TCDIV: 


AMDASM DLFINTION FILE FOR 1€-BIT COMPUTER 
USING AM2901A, AM2S@3, AM2924 & AM2916 
FILE CREATED BY STEVE CHENG 8/25/78 


REVISION 2.9 


12/6/78 


WORD 96 


DEFINITIONS FOR AM2919 SEQUENCER 


DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEF 
DEI 


4X,B#0,71X,H#G,16X 
4X,B#O,71X,H#1,4X ,12V% 
4X,B41,71X,H#1,4X,12V% 
4X, 3#9,71X,H#2,15X 
4X,B#0,71X, H#Z,4X, 12V% 
4X ,BH1,71X,H#3,4X ,12V% 
4X, B#@,71X,H#4,4X,12V% 
4X,241,71X,HA4,4%,12V% 
4X,E#2,71X,H#5,4X,12V% 
4X ,B#O,71X ,H#6,16X 

4X, B#¥1,71X, HAG, 16% 
4X,3#0,71X ,247,4X,12V% 
4X, B#D,71X, HAS, 4X, 12V% 
4X,B#0,71X,H#G,4X,12V% 
4X ,B#d,71X,H#A,16X 

4X ,BA1,71X, HHA, 16X 

4X, b#O,71X,H#B, 4X, 12V% 
4X ,BHO,71X ,HAC,4X,12VUHHFFF 
4X,Ba0,71X,H#D, 16X 

4X, BHO, 71X, H#E,16X 

4X ,BH#0,71X,H#F ,4¥ ,12V% 


DEFINITIONS FOR AM2963 ALU 


3JUMP ZERO 

sCOND JSB PL 
;UNCONDITIONAL JSR PL 
; JUMP MAP 

;COND JUMP PL 
;UNCONDITCNAL JUMP PL 
;PUSH/COND LD CNTR 
;PUSH AND LD CNTR 
CONT JSn R/PL 

s5CONTD JUMP VECTOR 
sUNCONDTICNAL JUMP VECTOR 
;CONET JUMP R/PL 
sREPEAT LOCK, CNTR <> @ 
;REPEAT PL, CNTR <> ¥ 
;COND RIN 
sUNCONDTIONAL RETURN 
7COND JUMP PL & POP 
3LD CNTR & CONT 

+TEST END LOOP 
+CONTINUE 

;THRELE-WAY BRANCH 


THE ALU DEFINTION IS OF THE FOLLOWING FORMAT 


ALU 


EQUATES FOR ALU 


EQU 
EQU 
EQU 
IQu 
EQU 
EQU 
FQU 
EQU 
EQU 
EQU 
EQU 
BQU 
EQU 
EQU 
EQU 
EQU 


DESTINATION 


H#O sARITHMETIC SHIFT DOWN, RESULTS INTC RAM 
H#1 ;LOGICAL SHIFT DOWN, RESULTS IATO RAM 

H#2 sARITH. SHIFT DOWN, RESULTS INTO RAM AND Q 
BARS sLOGICAL SHIFT DOWN, RESULTS INTO RAM ANL Q 
Ha4 *RESULTS INTO RAM, GENERATE PARITY 

H#S sLOGICAL SHIFT DOWN Q, GENERATE PARITY 

h# 6 #RESULTS INTO Q, $SHNERATE PARITY 

He? sRESULTS INTO RAM AND C, GENERATE PARITY 
HHS sARITH. SHIFT UP, RESULTS INTO RAM 

he#9 sLOGICAL SHIFT UP, RESULTS INTO RAM 

HHA sARITH. SHIFT UP, RESULTS INTO RAM AND Q 
H#B sLOGICAL SHIFT UP, RESULTS INTO RAM AND Q 
h#C +RESULTS TO Y BUS ONLY 

HAD sLOGICAL SHIFT UP Q 

H#E +SIGN EXTEND 

ha ¥ ;RESULTS TO RAM, SIGN EXTEND 


EQUATES FOR ALJ FUCNTIONS 


CONTROL, FUNCTIOGN 
DESTINATION CONTROL 


EQU HA sFI = 1 

EQU Hel sSUBTRACT R FROM S 

EQU H#2 ;SUBTRACT S FROM R 

FQU HAS ,ADD R AND S 

EQU H#4 ;PASS § 

FQU H#5 32°S COMPLEMENT OF S$ 
EQU H#¥6 ;PASS R 

EQU H¥7 32°S COMPLEMENT OF R 
EQU He >FI = @ 

EQU Hes s>COMPLEMENT R AND WITE S 
YQU HHA s;EXCLUSIVE NOR R WITH S 
EQU H#RB sEXCLUSIVE OR R WITH S 
EQU HAC sAND R WITH S 

EQU H#D +NOR R WITH S 

EQU H#E SNAND Kk WITH S 

EQU H#F 3;OR R WITH S 


ALU DEFINTION 
DEF OX ,4VH#C ,4VHH8, 79X 
ALU OPERAND SOURCES 


DEF 6X, 2#@,1X,B#0,6X,BH®, 7BX ;R = RAM A, S = RAM B 
DEF 6X,B4@,1X, BH1,8X , BAB, 78X 3R = RAM A, S = DB 
DEF 6X,B#@,10X,B#1,78X 3R = RAM A, S = Q 

DEF 6X,B#1,1X,B#@,8X , BHO, 7AX 3R = DA, S = RAM B 
DEF 6%..B#1,1X,B#1,8X , BAG, 78X sR = DA, S = DB 

DEF 6Y,3B#1,10X, B#1,78X ;R = DA, S$ = Q 


WORD/BYTEL CONTROL 

DEF 5X ,B#®8,90X 

OUTPUT Y ENABLE 

DEF 7X, B#S,88X 

SPECIAL FUNCTIONS FOR AM29@3 


TO USE THE SPECIAL FUNCTIONS, THE DESTINATION 
CONTROL MUST NOT BE AQ OR DAQ 


SPECIAL FUNCTION EQUATES 


EQU H#86 sUNSIGNED MULTIPLY 

EQU H#20 3TWO°S COMPLEMENT MULTIPLY 

EQU H#4a ; INCREMENT BY ONE OR TWO 

EQU H#5@ > SIGN-MAGNITUDE/TWO°S COMPLEMENT 

EQU H#6@ 3TWC°S COMPLEMENT MULT. LAST STEP 

EQU H#8@ sSINGLE LENGTH NORMALIZE 

EQU H#AG >DOUBLE LENGTH NORMALIZE AND 1ST DIVIDE OP. 
EQU Hace sTWO°S COMPLEMENT DIVIDE 


TCDC: 


SPF14: 


ts 
a 


foo} 
ow 


ewe we CL) we ws Se) we we we we WP we we we we We we we w 


PCUAQ: 
PCUAB: 
PCUZQ: 
PCUZE: 
PCUZA: 
PCUDA: 
PCUDQ: 
PCUDZ: 
; 

; 

; 

A®: 
Al; 
A2: 
AS: 
A4: 
AS: 
AG: 
A?: 


OF oe 08 ce 00 as 00 of 


see ee te td td to tn tol te tr we we we 
NOORGN HE & 


acd 
Q 
a 


, 
PCU.NEXT 


PCU.PUSH 
PCU.POP: 


PCU.JUMP: 


PCU.TR2: 
PCU.NOP: 
PCU.SP: 

PCU.DEC4 


MOO we we we Oe we we we we we Oe we WO WO we OO we 


SHIFTEN: 
OECT: 
EZ: 

EC: 

ES: 
EQVR: 
CEM: 
CEU: 


EQU H#EG ;TWwO’S COMPLEMENT DIVISION CORRECTION 


SPECIAL FUNCTION DEFINITION 


DEF SX, 8VH#, 79X 
DEFINITION FOR AM29@1 PROGRAM CONTROL UNIT (PCU) 
PCU REGISTER DEFINITIONS: 

PC 


R@ = PROGRAM COUNTER 

Ri = SP STACK POINTER 

Re = SPLL STACK POINTER LOWER LIMIT 
R3 = SPUL STACK POINTER UPPER LIMIT 
R4 = 2 CONSTANT @ 

R5 = 4 CONSTANT 2 

EQUATES FOR PCU DEFINITIONS 


EQU Bas 
EQUATE FOR PCU FUNCTIONS 


9Q REG = ZERO, B-RAM = ONE DEFAULT 


EQU Bri 9SUB = ONE, ADD = ZFRO DEFAULT 


EQUATES FOR SOURCE CONTROL 


EQU Q#@ 
EQU Q#1 
EQU Q#2 
EQU Q#3 
EQU Q#4 
EQU Q#5 
EQU Q#6 
EQU Q#7 


EQUATES FOR PCU A-RAM 


EQU Q#8 
EQU Q#1 
EQU Q#2 
EQU Q#e 
EQU Q#4 
EQU Q#5 
EQU Q#6 
EQU Q#7 
EQUATES FOR PCU B-RAM 
EQU Q#@ 
EQU Q#1 
EQU Q#2 
EQU Q#3 
EQU QH#4 
EQU Q#5 
EQU QHE 
EQU Q#7 


PCU DEFINITION 
DEF 31X,1VB#1,1 VEO, 3VQH1, SVQ4, SVQH, 54X 


H DEF 31X, B¥10@61160068 ,54X #PC = PC + 2 

: DEF 31X,B#11001100001 ,54X ;S5P = SP ~ 2 
DEF 31X,B#106@1196901 ,54% ,oP = SP + 2 
DEF 31X , F#19111988000 ,54% iPC = D 
DEF 31X ,B#16191108060 ,54X ;PC = TPEG + 2 
DEF 31X, BH19911980088 ,54X ;PC TO OUTPUT 
DEF 31X,B#10011901001 ,54X 3SP TO OUTPUT? 

3 LEF 31X ,2W1103119190008 ,54X 9PC = PC - 4 


DEFINTIONS FOR AM29@4 RELATED CONTROL BITS 


AM2 
BIT 
BIT 
BIT 
BIT 
BIT 
BIT 
BIT 
BIT 
BIT 
BIT 
BIT 
BIT 


964 BIT DEFINITIONS ARE AS FOLLOWS: 

S$ 95-44 = DON’T CARES 

43 = SHIFT ENABLE 

S 42-35 = GENERAL PURPOSE CONTROL BITS 


34 = OUT EN CONDITIONAL TEST 
33 = ENABLE ZERO 

32 = ENABLE CARRY 

31 = ENABLE SIGN 

3@ = ENABLE OVERFLOW 

29 = ENABLE MACHINE STATUS 
28 = ENABLE MICRO STATUS 


S 27-26 = CARRY OUT CONROL 
S 25-2@= CONDTIONAL BRANCH TEST 
hQU BHO 3SHIFT ENABLE 
EQU Bee s;OUT EN CONDITIONAL TEST 
EQU BHO ;ENABLE ZERO 
EQU BHe jENABLE CARRY 
EQU Brg ; ENABLE SIGN 
EQU Bae yENAELE OVERFLOW 
EQU BHd ;ENABLE MACHINE STATUS 
EQU BAe ;ENABLE MICRO STATUS 


’ 
AM29@4: DEF 52X,1VBH1,6X,1VEH1,1VB#41 ,1VBH1,1VB41,1VBH1 ,LVBH1,1VBH1,28X 


EST: 


We we we py we we we 


COEQS: 
COEQ1: 


COEQCI: EQU BH12 
COEQST: EQU B#11 


we we ee Cw we ww 


TES 


T BITS DEFINTION 


DEF 76X, 6VQ#, cox 
EQUATES FOR AM2964 CARRY-OUT CONTROL 
EQU BHOd +CARRY-OUT = @ 
EQU BH#O1 ;CARRY-OUT = 1 
sCARRY-OUT = CARRY~IN 
;CARRY-OUT = CARRY OF STATUS REGISTER 


CARRY-OUT CONTROL DEFINITION 


ARRYCTL: 


REG 


DEF 68X,2V 3B#8@,26X 
ISTER MUX SELECT 


Figure 20. Definition File for 16-Bit Computer. 


AMDOS/29 AMDASM MICRO ASSEMBLER, V1.1 
DEFINITION FILE FOR 16 BIT COMPUTER 


DEF Bed, 95X sROUTE R1 TO RAM B 
; 


; EQUATES FOR DATAPATH DEFINITION 


, 

ZZI: EQU Bel 3Z REG TO ZI REG 

ENTREG: EQU Ba +ENABLE TRANSFER REGISTER 
LDTREG: EQU B#1 +LOAD TRANSFER REGISTER 
ENCTR: EQU BHO +I-REG EN CTR 

INC: EQU Bel 3I-REG INC/DEC* 

PCUY: EQU BHel +PCU TRANSCEIVER TO Y-BUS 
YMAR: EQU BROS *PCU TRANSCEIVER TO MAR BUS 
PCUMAR: EQU B#i1 >PCU TRANSCEIVER CEIP DISABLE 
LDMAR: EQU B#i +LOAD MAR 

LDD: EQU Bel +LOAD D REG 

ZIT: EQU Bl sLOAD ZI INTO I-REG 

ENZ8: EQU BHO s ENABLE 28 TO DA 

PSW: EQU B5#@ j;ENABLE PS 

SHTCNTEN:EQU BHO sSHIFT CNT 2916 ADDR 

BRIEN: EQU B#@ ;BRANCH INSTRUCTION ENABLE 


; DATAPATH DEFINITION 


; 
DATAPATH: DEF 3X,1VB#0,14X,1VB#1,1VB#0,1VBH1,10B#2,2VBH11,1VB#B, 


1VB#8,1VBHO,1VBH1,1VE#H1,1VP#1,1VBH1,55X 
EQUATES FOR MEMORY CONTROL 
’ 
REQB: EQU BH1 9BUS REQUEST 
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MREQ: EQU Bal ;MEMORY REQUEST 
HREQ: EQU Bee sHOLD REQUEST 
WRITE: EQU B#l >MEMORY WRITE 
MWORD: EQU B#1 ;MEMORY WORD/BYTE* 


; DEFINTION FOR MEMORY CONTROL 

MEM .CONT :DEF 42X,1VBH@,1VB#O,1VBH1 ,LVBH2 ,1VBHD,49X 
EQUATES FOR CONTROL STROBES 

ROM: — EQU B#1 ;ROM/IKEGEN* 

IOEN: EQU Bao 31/0 CONTROL REG. ENABLE 


INTDIS: EQU BHO ; INTERRUPT DISABLE 
INTRIENSEQU BRO ;ENABLE I®-I3 ON AM2914 


; CONTROL STROBE DEFINTION 

CONTROL: DEF 48X,1VBA0,1VBH1,1VBH1,1VBH1,44x 
CONROL BITS DEFINITION 
DEF 53X,8VH#,35X 
IMMEDIATE ROM DEFINITION 
DLF 47X,B#®@,32X,16VH# sENABLE IMMEDIATE OPERAND 
END 


TOTAL PHASE 1 ERRORS = 





Figure 20. Definition File for 16-Bit Computer (Cont.). 


Memory Control 


The memory control equates assign mnemonics to different 
memory control bits. 


Control Strobe and Control Bits 


The control strobe equates assign mnemonics to the control bit 
strobe signals. The control bit definition defines a hexadecimal bit 
pattern for the 8 control bits. 


Immediate Operand 


When the Am2910 sequencer is executing an instruction which 
does not require an address operand, bits 15-0 in the microword 
can be used as a 16-bit Constant to load ALU, PCU etc. This is 
accomplished by putting the constant in bits 15-0 and force bit 48 
to logic 0. 


MICROCODE 


Flowcharts 


The flowcharts of the major instruction types are shown in the 
following figures. 


Figure 21 illustrates the basic microprogram flowchart and dem- 
onstrates how the pipelining is done in microcode. This figure 
illustrates the sequencing of the computer starting with no in- 
structions in the pipeline. By the fourth microinstruction, the 
pipeline is full and the CPU can execute for example a mac- 
roinstruction every microcycie. 


Figure 22 illustrates the execution of an RR instruction. During an 
RR instruction, PC +6 is loaded into the MAR and a bus requestis 
issured for the content of PC+6. The contents of PC+4 are read 
into the Z register. The Z, and | Registers are loaded with the 
contents of PC+ 2. The instruction at PC is executed. The input to 
the mapping PROM is loaded with the contents of PC +2. Thus in 
a stream of RR instructions, four instructions are in progress 
concurrently. 


Figure 23 illustrates the execution of an RX instruction. In this 
figure the decode operation takes the microprogram to the mi- 
crostep where the form address operation is done. Since the 
decode of the instruction has been completed in the previous 
step, the form address microinstructions are unique to each RX 
instruction in spite of the fact the operation performed is identical. 





LOADS PC INTO MAR 
REQUESTS BUS FOR INST N 


LOADS PC+2 INTO MAR AND PC 


REQB, MREQ 
START1 REQUESTS BUS FOR INST N+1 
PC+2—e MAR, PC READS INST N 


LOADS PC+2 INTO MAR AND PC 
REQB, MREQ REQUESTS BUS FOR INST N+2 


PC+2 —— MAR, PC READS INST N+1 
SIARTs Z-—Z,, 2,1 LOADS Z, REG WITH INST N 
DECODE LOADS | REG WITH INST N; 
DECODES INST N. 


EXECUTE 
INTERRUPT 
ROUTINE 


EXECUTE 
INSTRUCTION 


Figure 21. Microprogram Start Up Flow Chart. 


From the form address step, the microprogram jumps to 
FETCHOP where the operand is fetched. This step returns to 
where the instruction is actually executed. 


Figure 24 illustrates the execution of an RSI instruction. At the first 
microstep, the immediate operand is already in the Zp register. So 
the instruction is executed in the first step. The microprogram is 
then jumped to START2 to refill the pipeline. 
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FROM INST N—1 


REQB, MREQ 
PC+2—= MAR, PC 
Zz ~~Z, ; 21 —~ | 
EXECUTE 
DECODE 


REQUESTS BUS FOR INST N+3 
READS INST N+2 

PUTS PC+2 INTO PC AND MAR 
LOADS Z, AND | WITH INST N+1 
EXECUTES INST N 





DECODES INST N+1 


TO 
INTERRUPT 
HANDLER 


TO 
INST N+1 


RR INSTRUCTIONS IMPLEMENTED 


OPCODE R1, R2 
Load Register R1 = (R2) 
Add Register R1 = (R1) + (R2), Set CC 
Subtract Register R1 = (R1) — (R2), Set CC 
AND Register (R1) AND (R2), Set CC 
OR Register R1 = (R1) OR (R2), Set CC 
Compare Logical Register Set CC with (R1) — (R2) 


Exclusive OR Register R1 = (R1) XOR (R2), Set CC 
MPR-697 





Figure 22. RR Instruction Flow Chart. 


REQB, MREQ FORMS OPERAND ADDRESS BY ADDING Z)+X2 
Zo+X>—— MAR REQUESTS BUS FOR OPERAND 
JSB FETCHOP READS INST N+1 


REQB, MREQ LOADS PC+2 INTO PC, MAR 
PC +2 —~— MAR, PC REQUESTS BUS FOR INST N+2 
READS OPERAND 
LOADS INST N+1 INTO Z, 


FETCHOP 


LOADS PC+2 INTO PC, MAR 
Se REQUESTS BUS FOR INST N+3 
PC+2-= MAR, PC 
eat || Gero 
LOADS Z, WITH N+1 
aes DECODES INST N+1 
EXECUTES INST N 


TO 
INTERRUPT 
HANDLER 


TO 
INST N+1 


RX INSTRUCTIONS IMPLEMENTED 


OPCODE A, X2 (DISP) 

LD R1, X2 (D) R1 = (X2) + D 

ST R1, X2 (D) (X2) + D = (R1) 

ADD R1, X2 (D) R1 = (R1) + [(X2) + D], Set CC 
SUB R1, X2 (D) R1 = (R1) — [(X2) + D], Set CC 

N R1, X2 (D) R1 — (R1) AND [(X2) + D], Set CC 
O R1, X2 (D) R1 = (R1) OR [(X2) + D], Set CC 


CMP R1, X2 (D) Set CC FOR (R1) — [(X2) + D] ene 
-698 





Figure 23. RX Type Instruction. 


REQB, MREQ 
PC+2 —— MAR, PC 
EXECUTE 
JMP START2 


REQB, MREQ 
PC+2-—= MAR, PC 
Z-—Z,,2,;—~1 
DECODE 


START2 


TO 
INTERRUPT 
HANDLER 


TO 
INST N+1 


LOADS PC+2 INTO MAR, PC 

REQUESTS BUS FOR INST N+2 

READS INST N+1 

EXECUTES INST N WITH IMMEDIATE OPERAND IN Z) 


JUMPS TO START2 


LOAD PC+2 INTO MAR, PC 
REQUESTS BUS FOR INST N+3 
READS INST N+2 

LOADS INST N+1 INTO Z, 
LOADS IREG 

DECODES FOR INST N+1 


IMMEDIATE INSTRUCTIONS IMPLEMENTED 


OP CODE R1, DATA 

LI Load Immediate 
NI AND Immediate 
Ol OR Immediate 


xl Exclusive or Immediate 


Al Add Immediate 
Sl Subtract Immediate 
Cl Compare Immediate 


R1 = DATA 
R1 = (R1) AND DATA, Set CC 
R1 = (R1) OR DATA, Set CC 
R1 = (R1) XOR DATA, Set CC 
R1 = (R1) + DATA, Set CC 
R1 = (R1) — DATA, Set CC 
Set CC with (R1) — DATA 





Figure 24. Immediate Instructions. 


Figure 25 illustrates the execution of an unconditional branch 
Instruction. At the first microstep the displacement is already in 
the Zp register. The branch address is formed by adding the 
contents of the Zp register to the contents of the index register X;. 
The MAR ts loaded with the branch address and a bus request Is 
issued for the contents of the branch address. The branch ad- 
dress is also loaded into the transfer register for subsequent 
loading of PC. In the next step, the contents of the transfer 
register+2 Is loaded into the PC and MAR. A bus request is 
issued to BA+2. The content of BA is read. The microprogram is 
then transferred to START2 to fill up the pipeline. 


Figure 26 illustrates the Conditional Branch instruction. In step 1, 
unlike the Unconditional Branch instruction, the contents of the 
memory (instruction N+1) is read, in case the test condition fails 
and the macro program falls through. The condition test 1s en- 
abled in this step. Ifthe test passes, the microprogram transfers to 
Unconditional Branch routine. If the test fails, the microprogram 
proceeds to fill the pipeline and continue. 


Figure 27 illustrates the branch and link instruction. The flowchart 
is similar to Unconditional Branch except an extra step (STEP 2) 
Is inserted. This step saves PC tn R;. 


Figure 28 illustrates a shift or rotate instruction. In STEP 1 the 
opcode of the next instruction is loaded into Z, registers and the 
shift count of the shift instruction is loaded into the loop counter of 
Am2910. STEP 2 executes the shift instruction N+ 1 times, where 
N ts the shift count in the instruction. It should be noted that since 
Am2910 detects — 1 as the stop condition, the shift count loaded 
should be one less than the desired count. Step 3 is the same as 
the RNI (request next instruction). Itis duplicated because the fail 
condition of RPCT in Am2910 can only fall through. 


INST N—1 


REQB LOADS MAR AND TREG WITH 
29+X2 =~ MAR BRANCH ADDRESS 
Zot Xo —= TREG REQUESTS BUS FOR INST AT BA 


REQUESTS BUS FOR INST BA+1 
READS INST BA 

LOADS MAR AND PC WITH 
ADDRESS OF INST BA+1 


REQB, MREQ 
TREG+2 -~ MAR 
TREG+2 -~ PC 


REQUESTS BUS FOR INST BA+2 
READS INST BA+1 

LOADS INST BA INTO Z, AND! REG 
DECODES INST BA 


REQB, MREQ 
PC+2—— MAR, PC 
Zz —~Z:, 2; | 
DECODE 


START2 


BA = BRANCH ADDRESS 
TO 
INTERRUPT 
HANDLER 


BX X,, (D) 


BRANCH PC = (X,)+D 


Tay 


Figure 25. Unconditional Branch. 
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REQB, MREQ 
Zo+Xo —> MAR 
Zo+Xo ~~ TREG 

BRIEN 


BRANCH 
CONDITION? 


REQB 
PC+2 -~PC, MAR 
Y ono Z1 


REQB, MREQ 
PC+2—— MAR, PC 
Z,-~!1 


1 
DECODE 


TO 
INTERRUPT 
HANDLER 


REQUESTS BUS FOR BRANCH ADDR 
READS INST N+1 
LOADS MAR AND TREG WITH BRANCH ADDR 


REQUESTS BUS FOR INST N+2 
LOADS PC AND MAR WITH PC+2 
LOADS Z, WITH INST N+1 


REQUESTS BUS FOR INST N+3 
READS INST N+2 

LOADS PC, MAR WITH PC+2 
LOADS Z, WITH INST N+1 
DECODES FOR INST N+1 


TO 
INTERRUPT 
HANDLER 


START2 


REQB, MREQ 
TREG+2 —~ MAR, PC 


REQB, MREQ 
PC+2-= MAR, PC 
Z—~—Z,,Z.-——| 
DECODE 


REQUESTS BUS FOR BA+1 
READS INST AT BA 
LOADS PC, MAR WITH TREG+2 


REQUESTS BUS FOR BA+2 
READS INST AT BA+1 

LOADS PC, MAR WITH PC +2 
LOADS Z,, IREG WITH INST AT BA 
DECODES FOR INST AT BA 


TO 
INST N+1 


BC CC,X,; (D) 


TO INST AT BRANCH ADDR 





MPR-701 


Figure 26. Conditional Branch. 


Figure 29 illustrates the input instruction. In STEP 1, the I/O Port 
Address is formed by adding Zp) and Xp. Bus request is issued for 
the I/O Port. The desired width of the I/O read pulse is loaded into 
the Am2910 Loop Counter. The width of the I/O read pulse is 
(N+2) X cycle time where N is the number loaded. The I/O read 
signal is turned on. In STEP 2, the bus is held for the I/O address 
and the loop counter is decremented until it becomes —1. In 
STEP 3, I/O read pulse is turned off but I/O address is held for 
possible address hold time requirement of the I/O device. On the 
trailing edge of the I/O read pulse, the content of the I/O Port is 
strobed into the Zp register. In STEP 4, the content of Zp) register 
is loaded into R,, thus completing the I/O read. Bus request is 
issued for the next instruction and microprogram jumps to 
START 1 to refill the pipeline. 


Figure 30 illustrates the output instruction. In STEP 1, bus request 
is ussued for the I/O Port Address. In STEP 1, the content of R, is 
transferred to the D register for outputting to the data bus. The I/O 
write pulse is set and the width of the write pulse is loaded into the 
Am2910 Loop Counter as in the input instruction. In STEP 3, the 
/O address s held until loop counter becomes —1.in STEP 4, the 
content of the D register is strobed into the I/O Port by turning off 
the I/O Write Pulse. The microprogram jumps to START to refill 
the pipeline. 


The Figures 21-30 illustrate the major instruction types im- 
plemented. These are by no means the only possible instructions 
for the 16-bit computer described. Some other instructions such 
as stack instructions are shown in the microcode but not in the 
figures and should be easily understood with the above examples 
as a guide. 


Figure 31 illustrates the implementation of some typical instruc- 
tions. Instruction O is the restart instruction. It jumps to INIT 
which is located in location H#180 because the mapping PROM 
maps only into the first 256 locations. So it is desirable to pre- 
serve these locations for Macro instructions. The initialization 
routine does the following: 


. Turn on I/O reset signal and jump (Inst H#0) 

. Set Ro in ALU to 0 (Inst H#180) 

. Set Ro in PCU (PC) to 0 (Inst H#181) 

. Set R; in PCU (SP) to H#4000 (Inst H#182) 
. Set Ry in PCU to 2 (Inst H#183) 

. Set Rs in PCU to 4 (Inst H#184) 

. Turn off I/O reset signal (Inst H#185) 

. Initialize console USART (Inst H#186-H#190) 


ONOaAA WN — 


The microinstruction that executes macroinstructions are 
grouped as follows: 


Type Figure Microinst # 
(Hex) 
RR Instructions 22 005-00B 
RX Instructions 23 00C-01B 
RSI Instructions 24 01C-022 
Branch Instructions 25-27 023-02A 
Shift Instructions 28 02B-042 
Input Instruction 29 043-046 
Output Instruction 30 047-04A 
Stack Instructions < 04B-059 
Interrupt Instructions = 05A-061 


INST N—1 


Zo+Xo —~> MAR 
Zp+Xo-= TREG 


REQGB, MREQ 
TREG+2—-— PC 
TREG+2 -— MAR 


REQB, MREQ 
PC+2— MAR, PC 
Y Local 21; 2; —- | 
DECODE 


START2 


TO 
INTERRUPT 


HANDLER TO 


INST N+1 
BAL Rj, X> (D) 


LOADS MAR AND TREG WITH 
BRANCH ADDRESS 


SAVES PC IN Ry 
REQUESTS BUS FOR INST N+1 


REQUESTS BUS FOR INST N+2 
READS INST N+1 
LOADS TREG+2 INTO PC, MAR 


REQUESTS BUS FOR INST N+3 
READS INST N+2 

LOADS PC+2 INTO PC, MAR 

LOADS INST N+1 INTO Z,; AND IREG 
DECODES FOR INST N+1 


R, = (PC), PC = (X2)+D 
MPR-702 


Figure 27. Branch and Link. 


FROM 
INST N—1 















zZ—~ Z1 
N-—= SHIFT CNT 


EXECUTE SHIFT 
OR ROTATE 
CNT —1—-= CNT 


INST N 





REQB, MREQ 
PC+2 —=— MAR, PC 
2; —~ | 
DECODE 


TO 
INTERRUPT 


HANDLER 
TO 


INST N+1 


LOADS INST N+1 INTO Z 
LOADS SHIFT COUNT OF INST N 
INTO LOOP COUNTER 


EXECUTE 1 CYCLE OF SHIFT 
OR ROTATE DECREMENT 
LOOP COUNTER 


IF LOOP COUNTER = —1, 
GO TO NEXT INST 
OTHERWISE, REPEAT CURRENT INST 


REQUESTS BUS FOR INST N+3 
READS INST N+2 

LOADS Z, WITH INST N+1 
DECODES INST N+1 


MPR-703 


Figure 28. Shift and Rotate Instructions. 


INST N—1 


REQB, HREQ 
Zo+X -= MAR 
IOCNT -— CTR 

SET IOR 


Y 
HREQ 
RESET !OR 


REQB 
Zo Ry 
JMP START1 


TO 
START1 
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REQUESTS BUS FOR INPUT PORT 
HOLDS THE BUS 

LOADS MAR WITH I/O ADDRESS 
LOADS 1/0 COUNT INTO 

LOOP COUNTER 

TURNS ON 1/0 READ 


HOLDS BUS 
DEC, LOOP COUNTER 
UNTIL CTR = —1 


HOLDS BUS 
TURNS OFF 1/0 READ 
READS I/O PORT CONTENT 


REQUESTS BUS FOR INST N+1 
LOADS I/O PORT CONTENT INTO R1 
RE-FILLS PIPELINE FROM “START1” 


MPR-704 


Figure 29. Input Instruction. 


REQB, HREQ 
Xo+Zo —» MAR 


SET lOW, HREQ 
R,—™ DREG 
\OCNT-= CTR 


HREQ 
CTR—1-—~CTR 


RESET IOW, HREQ 
JMP START 


TO START 





LOADS MAR WITH 

OUTPUT PORT NUMBER 
REQUESTS BUS FOR OUTPUT 
HOLDS THE BUS 


TURNS ON 1/0 WRITE 

HOLDS THE BUS 

LOADS DREQ WITH R1 

LOADS 1/O CNT INTO LOOP COUNTER 


HOLDS THE BUS FOR I/O ADDRESS 
DEC LOOP COUNTER 
UNTIL CTR = —1 


WRITES THE CONTENT OF DREQ 
INTO OUTPUT PORT 

HOLDS THE BUS 

JUMPS TO “START” 

TO REFILL PIPELINE 


MPR-705 


Figure 30. Output Instruction. 
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Upon an interrupt, the 16-Bit Computer finishes its current in- 
struction and jumps to microinstruction H#1FF. The interrupt 
handler works as follows: 


1. Current PSW is stored in DREG and SP 
H#1FF). 


SP-—2 (Inst 


2. The content of PSW is written onto the stack in memory. PC = 
PC—4 to flush out the pipeline (Inst H#1FO). 

3. SP = SP—2 (Inst H#1F1). 

4. The content of the adjusted PC is written to the DREG (Inst 


H#1F2). 


5. The content of the PC is written onto the stack in memory and 
the vector in the Am2914 is output to the interrupt vector 
PROM. A vector jump ts made following this instruction de- 
pending on the interrupt number (inst H#1F3). 


6. The vector jump directs to 1 of 8 locations labelled INTg-INT7. 
For INT,-INT7, the first instruction disables interrupt in the 
Am2914 and forces new PC value into PC. INT) requires an 
extra instruction to clear the Am9519. The interrupt vector in 
the Am9519 is to be determined by the macro interrupt 
handler. 

This next instruction is the same as the START instruction. 
The previous instruction cannot jump to START directly be- 
cause the immediate operand uses the jump address field. 
The macroprogram resumes at the new PC value. 


The instructions implemented cover only a small portion of all 
possible instructions. Only 137 or 512 microinstructions are used. 
The rest of the instruction space could be used to vastly enhance 
the instruction set such as byte operations, storage to storage 
instructions, etc. 
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TART: 


TART1: 


TART2: 
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MICROPROGRAM FOR AMD 16-BIT COMPUTER 
WRITTEN BY STEVE CHENG 9/78 
REVISION 1.1 12/15/78 


HE 2 Me ae Me MR a a A eae eae a ae ae ae abe Rae ae ae ae a ae ae ae a ae ae aK 


RESET SEQUENCE STARTS HERE 


ALU & WORD & CONTROL ,IOEN,INTDIS, & CNTLB H#7F & DATAPATH & 
MEM.CONT REQB,,HREQ,,MWORD & AM2984 & PCU.NOP & JMP INIT 


REQUEST BUS FOR INSTRUCTION N 


ALU YBUS,PASS & AB & WORD & OEY & CARRYCTL & CONTROL & 
DATAPATH 4494+, LDMAR,555,, & MEM.CONT REQB,,,,MWORD & 
AM2°084 & PCU.NOP & CONT 


REQUEST BUS FOR INSTRUCTION N+1. READ INSTRUCTION N 


ALU YBUS,PASS & AB & WORD & OEY & CARRYCTL & CONTROL & 
DATAPATH 459+,,LDMAR,,5,,5, & MEM.CONT REQB,MREQ,,,MWORD & 
AMZ9@4 & PCU.NEXT & CONT 


REQUEST BUS FOR INSTRUCTION N+2. READ INTRUCTION N+1. 
LOAD 21 REGISTER WITH INSTRUCTION N. 
DECODE FOR INSTRUCTION N 


ALU\YBUS, PASS & DAB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH Z221,,,,,,LDMAR,,ZII,,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2964 & PCU.NEXT & JMAP 


REQUEST NEXT INSTRUCTION 


ALU YBUS,PASS & DAB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH ,,,,,,LDMAR,,ZII,,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2904 & PCU.NEXT & JMAP 


SPSS SSH SSS HRS SS Ses SSS SS SHH SSS Se I SSS SSSH== 


RR TYPE INSTRUCTIONS 


LOAD REGISTER 18 RR CC: NONE 
LR R1,R2 = (R2) 


ALU REG,PASSR & AB & CARRYCTL & OEY & WORD & 
DATAPATH ZZ1,,,,,,1LDMAR,,Z11,,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2984 & CONTROL & PCU.NEXT & JMAP 

CC: 


ADD REGISTERS 1A RR CSVZ 


AR R1,R2 = (R1) + (R2) 


ALU REG,ADD & AB & CARRYCTL & CEY & WORD & 
DATAPATH Z221,,,,,,LDMAR,,ZII,,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2904 ,,EZ,EC,ES,EOVR,CEM, & CONTROL & PCU.NEXT & JMAP 


SUBTRACT REGISTERS RR 
SR R1,R2 R1 = (R1) - (R2) 


CC: CSvZ 


ALU REG,SUBR & AB & CARRYCTL COEQ1 & OEY & WORD & 
DATAPATH ZZ1,,,,,,LDMAR,,ZI11,,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2904 ,,EZ,EC,ES,EOVR,CEM, & CONTROL & PCU.NEXT & JMAP 


AND REGISTERS 14 RR CC: CSVZ 
NR R1,R2 R1 = (R1) AND (R2) 


ALU REG,AND & AB & CARRYCTL & OEY & WORD & 
DATAPATH 221,,,,,,LDMAR,,ZII,,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2984 ,,EZ,EC,ES,EOVR,CEM, & CONTROL & PCU.NEXT & JMAP 


OR REGISTERS 16 RR 
OR R1,R2 R1 = (R1) OR (R2) 


CC: CSVZ 


ALU REG,OR & AB & CARRYCTL & OFY & WORD & 
DATAPATH ZZI,,,,,,1DMAR,,211,,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2964 ,,E2,EC,ES,EOVR,CEM, & CONTROL & PCU.NEXT & JMAP 


COMPARE LOGICAL REGISTERS 15 RR 
CLR R1,R2 CC = RESULT OF (R1) - (R2) 
CONTENTS OF Ri AND R2 ARE NOT AFFECTED 


CC: CSV2 


ALU YBUS,SUBR & AB & CARRYCTL COEQ1 & OEY & WORD & 
DATAPATH 221,,,,,,LDMAR,,Z11,,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2984 ,,B2,EC,ES,EOVR,CEM, & CONTROL & PCU.NEXT & JMAP 
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EXCLUSIVE OR ee RR CC: 
XOR R1,R2 (R1) XOR (R2) 


ALU REG,EXOR & AB & CARRYCTL & OEY & WORD & 
DATAPATH 22Z1,,,,,,LDMAR,,2I11,,,, & MEM.CONT REQB,MREQ,,,MWOKD & 
AM2S@4 ,,E2,EC,ES,EOVR,CEM, & CONTROL & PCU.NEXT & JMAP 


CSVZ 


RX TYPE INSTRUCTIONS 


LOAD 58 RX CC: NONE 
LD R1,X2(D) = {(xX2) + D] 


ALU YBUS,ADD & DAB & CARRYCTL & OLY & WORD & CONTROL & RTB & 
DATAPATH ,,,,,YMAR,LDMAR,,,ENZO,,, & MEM.CONT REQE,MREQ,,,MWORD & 
AMZS@4 & PCU.NOP & JSB FETCHOP 


ALU REG,PASSR & DAB & CARRYCTL & OFY & WORD & CONTROL & 
DATAPATH ,,,,,,LDMAR, ,ZI1,ENZ0,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2S04 & PCULNFXT & JMAP 

STORE : 50 RX cc: 
ST R1,X2(D) (x2) + D = (R11) 


NONE 


ALU YBUS,ADD & DAB & CARRYCTL & ORY & WORD & CONTROL & RTB & 
DATAPATH ,.,,5,YMAR,LDMAR,,,ENZ@,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2S04 & PCU.NOP & CONT 


ALU YBUS,PASS & AB & CAFRYCTL & OFY & WORD & CONTROL & 
DATAPATH ZZI,¢55e9,LDDy455,5 & MEM.CONT REQB,,,,MWORD & 
AM2934 & PCU.NOP & CONT 


ALU YBUS, PASS & AB & CARRYCTL & OFY & WORD & CONTROL & 
DATAFATH 4445,,LDMAR,,9.,, & MEM.CONT REQB,MREQ,,WRITE,MWORD & 
AM2904 & PCU.NEXT & JMP RNI 

ADD SA RX CC: CSVZ 
ADD R1,X2(D) Ri = (R1) + [(x2) + DJ 


ALU YBUS,ADD & DAB & CARRYCTL & OLY & WORD & CONTROL & RTB & 
DATAPATH 4,5,,,YMAR,LDMAR,,,ENZ@,,, & MEM.CONT REQS,MREQ,,,MWORD & 
AM2984 & PCU.NOP & JSB FETCFOP 


ALU REG,ADD & DAB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH ,,,,,,LDMAR,,ZII,ENZ0,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2964 ,,8Z,EC,ES,E0VR,CEM, & PCU.NEXT & JMAP 
SUBTRACT 5B RX cC: CSVZ 
SUB R1,X2(D) Ri = (R1) ~ ((X2) + DJ 


ALU YBUS,ADD & DAB & CARRYCTL & OEY & WORD & CONTROL & RTBS& 
DATAPATH ,,,,,YMAR,LDMAR,,,ENZ@,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2904 & PCU.NOP & JSB FETCHOP 


ALU REG,SUBR & DAB & CARRYCTL COEQ1 & OEY & WORD & CONTROL & 
DATAPATE 455+5,LDMAR,,ZII,ENZO,,, & MEM.CONT REQR,MREQ,,,MWORD & 
AM2904 ,,E2,EC,ES,EOVR,CEM, & PCU.NEXT & JMAP 
AND 54 RX CC: CSVZ2 
N R1,X2(D) = (R1) AND {(x2) + DJ 


ALU YBUS,ADD & DAB & CARRYCTL & OEY & WORD & CONTROL & RTB SG 


DATAPATH ,5,55,YMAR,LDMAR,,,ENZ@,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2984 & PCU.NOP & JSB FETCHOP 


ALU KEG,AND & DAB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH ,,,,,,LDMAR, ,ZI1,ENZ6,,, & MEM.CONT REQB, EC wens & 
AM29@4 ,,BZ,EC,ES,EOVR,CEM, & PCU.NEXT & JMAP 
OR 56 RX CC: CSv¥zZ 
O R1,X2(D) Ri = (R1) OR [(X2) + D] 


ALU YBUS,ADD & DAB & CARRYCTL & OEY & WORD & CONTROL & RTBS& 
DATAPATH ,,,,,YMAR,LDMAR,,,ENZ®,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AMZ9@4 & PCU.NOP & JSB FETCHOP 


ALJ REG,OR & DAB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH ,55++,LDMAR,,ZII,ENZ®,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AMZ964 ,,EZ,EC,ES,EOVR,CEM, & PCU.NEXT & JMAP 
COMPARE 55 RX CC: CSVZ 
CMP R1,X2(D) CC = RESULT OF (R1) - [(X2) + D] 

THE CONTENTS OF R1 AND (X2) + D ARE NOT AFFECTED 


ALU YBUS,ADD & DAB & CARRYCTL & OEY & WORD & CONTROL & RTB & 
DATAPATH ,,,5+,YMAR,LDMAR,,,ENZ8,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2984 & PCU.NOP & JSB FETCHOP 





Figure 31. Microprogram for 16-Bit Computer. 








AMDOS/29 AMDASM MICRO ASSEMBLER, Vi. ; BRANCH REGISTER ALWAYS 
MICROPROGRAM FOR 16 BIT COMPUTER ; BRA Ri Pc = (R1) 
; 
; OO2A BR: ALU YBUS,PASS & DAB & CARRYCTL & OEY & WORD & CONTROL & 
@G1A ALU YBUS,SUBR & DAB & CARRYCTL COEQ1 & OFY & WORD & CONTROL & / DATAPATH ,,LDTREG,,,YMAR,LDMAR,,,5, & MEM.CONT REQB,,,,MWORD & 
/ DATAPATH ,4+,,5,LDMAR,,Z11,ENZ0,,, & MEM.CONT REQB,MREQ,,,MWORD & / AM2904 & PCU.NOP & JMP BX1 
/ AM2984 ,,EZ,EC,ES,EOVR,CEM, & PCU.NEXT & JMAP 
; $F AH 0 90 48 8 1 98 DB Hb HD 4 98 38 98 at HOt HD 8 Hh a0 Ht 0 0 8 HH a ry maser rcesasse ees Sse Sess SS Se2Sefesresess=ree 
; ; 
; SUBROUTINE TO FETCH OPERAND FROM MEMORY 7 SHIFT INSTRUCTIONS 
Hy ; 
; PIVITITITITITITTTITT TLL ; Sees cesses ssssscsssssssrssessssssseesse 
; ; 
6613 FETCHCP:ALU YBUS,PASSR & DAB & CARRYCTL & OEY & WORD & CONTROL & ; SHIFT LEFT ARITHMETIC 88 
/ DATAPATH 221,,,,,,LDMAR,,,,,, & MEM.CONT REQB,MREQ,,,MWORD & ; SLA R1, CNT R1 = SHIFT (Ri) ARITHMETIC LEFT CNT PLACES 
/ AM2904 & PCU.NEXT & RIN ; 
; @62B SLA: ALU YBUS,PASS & AB & WORD & OEY & CARRYCTL & CONTROL & 
, BSS SESS SSeSSsseSesereseseSee se seSseaeSssorezesosesesese= / DATAPATH ZZ1 Re SHTCNTEN, & MEM.CONT ee »MWORD & 
i / AM2964 & PCU.NOP & LDCT 
; IMMEDIATE INSTRUCTIONS ; 
4 @@2C ALU AUR,PASS & AB & WORD & OEY & CARRYCTL & CONTROL & CNTLB HAFO & 
i AM2904 SHIFTEN,,EZ,EC,ES,EOVR,CEM, & PCU.NOP & RPCT 
; LOAD IMMEDIATE 41 RSI CC: NONE Bee ae eet 
; 62D ’ ALU YBUS,PASS & AB & WORD & OEY & CARRYCTL : CONTROL & . 
i DATAPATH LDMAR, , 21 & MEM.CONT REQB,MRE MWORD 
@81C LI: ALU REG,PASSR & DAB & CARRYCTL & OEY & WORD & CONTROL & / AM29@4 & PCULNEXT & JMAP aNBEC yy 
/ DATAPATH ,,,,,,LDMAR,,,EN26,,, & MEM.CONT REQB,MREQ,,,MWORD & ; 
/ AM2984 & PCU.NEXT & JMP START2 : SHIFT LEFT LOGICAL 
; SLL Ri, CNT Ri = SHIFT (R1) LEFT LOGICAL CNT PLACES 
AND IMMEDIATE 94 RSI CC: CSVZ ; 
= Ri AND DI @82E SLL: ALU YBUS,PASS & AB & WCRD & OFY & CARRYCTL & CONTROL & 
/ DATAPATH Z2ZI,0¥0990909eSHTCNTEN, & MEM.CONT 
O@iD NI: ALU REG,AND & DAB & CARRYCTL & OEY & WORD & CONTROL & / AM29@4 & PCU.NOP & LDCT 
DATAPATH ,,,,,,LDMAR,,,ENZ@,,, & MEM.CONT REQB,MREQ,,,MWORD & : 
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Load 
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All: 


ALR 


soe SA th we ee we we NA tee SA bed we ee wee ee 


OR IMMEDIATE 


ALU REG,OR & DAB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH ,,,,,,LDMAR,,,ENZ@,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2904 ,,EZ,EC,ES,BOVR,CEM, & PCU.NEXT & JMP START2 


EXCLUSIVE OR IMMEDIATE 


ALU REG,EXOR & DAB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH ,,,5,5, LDMAR,,,ENZ6,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2964 ,,EZ,EC,ES,EOVR,CEM, & PCU.NEXT & JMP START2 


ADD IMMEDIATE 
R1 = R1 + DI 


SUBTRACT IMMEDIATE 
Ri = R1 - DI 


ALU REG,SUBR & DAB & CARRYCTL COEQ1 & OEY & WORD & CONTROL & 0034 
DATAPATH ,,,,,, LDMAR,,,ENZ@,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2904 ,,EZ,EC,ES,EOVR,CEM, & PCU.NEXT & JMP START2 


COMPARE IMMEDIATE 
CC = RESULT OF R1 - DI 
THE CONTENT OF R1 IS NOT AFFECTED 


FEEEEEEEE EP EE EEE EHEEE EE EEE H EHH ee tee 


BRANCH INSTRUCTIONS 


PEELE EEFE EE EAP ESTHET EAE HEHE tt ttt tet 


BRANCH UNCONDITIONAL 
PC = (X2) + D 


ALU YBUS,ADD & DAB & CARRYCTL & OEY & WORD & CONTROL & RTB & 
DATAPATH , ENTREG,,,+,LDMARy 9599) & 

MEM.CONT REQB,MREQ,,,MWORD & AMZ2984 & PCU.TR2 & JMP START2 8839 ALU YBUS,PASS & AB & WORD & OEY & CARRYCTL & CONTROL & 
DATAPATH ,,,,,,LDMAR,,ZII,,,, & MEM.CONT REQB,MREQ,,,MWORD & 
ALU YBUS,PASS & AB & WORD & OEY & CONTROL & 

DATAPATH ,ENTREG,,,,,LDMAR,,,,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2904 & PCU.TR2 & JMP START2 


BRANCH ON CONDITION 4? 
1, PC = (x2) + D OSA 
(PC) + 2 


ALU YBUS,ADD & DAB & CARRYCTL & OEY & WORD & CONTROL & RTB & 
DATAPATH ,,LDTREG,,,YMAR,LDMAR,,,ENZO,,,BRIEN & TEST 57 & OO3B ALU LUR,PASS & AB & WORD & OEY & CARRYCTL & CONTROL & CNTLB H#FA & 
DATAPATH & MEM.CONT ,,,,MWORD & 


MEM.CONT REQB,MREQ,,,MWCRD & 
AM2994 SHIFTEN,,EZ,EC,ES,EOVR,CEM, & PCU.NOP & RPCT $ 


AMZ2G@4 ,OECT,,,,,, & PCU.NOP & CJP BX1 
BRANCH NOT NEEDED 


ALU YBUS,PASS & AB & WORD & OEY & CONTROL & 
TATAPATH 221,,5,5,5,LDMAR,,+,,, & MEM.CONT REQB,,,,MWORD & 
AM2904 & PCU.NEXT & JMP RNI 


BRANCH AND LINK 45 

BAL R1,X2(D) Ri = PC + 2, PC = [(X2) + D] 893D 
ALU YBUS,ADD & DAB & CARRYCTL & OEY & WORD & CONTROL & RTB & 
DATAPATH ,,LDTREG,,,YMAR,LDMAR,,,ENZ@,,, & MEM.CONT ,,,,MWORD & 
AM2904 & PCU.NOP & CONT 


ALU REG,PASS & DAB & CARRYCTL & WORD & CONTROL & 
DATAPATH 9552sPCUYse5951, & MEM.CONT REQB,,,,MWORD & 
AM2G04 & PCU.NOP & JMP BX1 


BRANCH AND LINK REGISTER 
Ri = (PC), PC 


ALU YBUS,PASSR & DAB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH ,, LDTREG,,,YMAR,LDMAR,,,ENZ@,,, & MEM-CONT +4,5,,MWORD & 
AM2964 & PCU.NOP & JMP BALL 


AM2904 ,,E2,EC,ES,BOVR,CEM, & PCU.NEXT & JMP START2 002F ALU LUR,PASS & AB & WORD & OEY & CARRYCTL & CONTROL & CNTLB HAFO@ & 
DATAPATH & MEM.CONT ,,,,MWORD & 
AM2984 SHIFTEN, ,EZ,EC,ES,EOVR,CEM, & PCU.NOP & RPCT $ 


96 RSI CC: CSVZ 


NN 


AM2904 & PCU.NEXT & JMAP 


SHIFT RIGHT ARITHMETIC 
97 RST CC: CSVZ 


8931 
AM2964 & PCU.NOP & LDCT 


weN NW we we we ee SO 


S¢32 ALU ADR,PASS & AB & WORD & OFY & CARRYCTL & CONTROL & CNTLB H#ES & 


DATAPATH & MEM.CONT ,,,,MWORD & 
AM2964 SHIFTEN,,5Z,EC,ES,EOVR,CEM, & PCU.NOP & RPCT $ 


QA RSI CC: CSVZ 


-—~N 


ALU REG,ADD & DAB & CARRYCTL & OEY & WORD & CONTROL & 832 ALU YBUS,PASS & AB & WORD & OEY & CARRYCTL & CONTROL & 
DATAPATH 4,55,, LDMAR,,,ENZ@,,, & MEM.CONT REQB,MREQ,,»MWORD & 
AM2904 ,,EZ,EC,ES,EOVR,CEM, & PCU.NEXT & JMP START2 


AM2984 & PCU.NEXT & JMAP 


9B RSI CC: CSV2 SHIFT RIGHT LOGICAL 


AM2964 & PCU.NOP & LDCT 


MeN NN Uwe we we ee SON 


95 RSI CC: CS¥Z 8635 ALU LDR,PASS & AB & WORD & OEY & CARRYCTL & CONTROL & CNTLB HAE® & 
DATAPATH & MEM.CONT ,,,,MWORD & 
AM2904 SHIFTEN, ,BZ,EC,ES,EOVR,CEM, & PCU.NOP & RPCT $ 


ALU YBUS,SUBR & DAB & CARRYCTL COEQ1 & OEY & WORD & CONTROL & 6636 ALU YBUS,PASS & AB & WORD & OEY & CARRYCTL & CONTROL & 
DATAPATH 55555 LDMAR,,,EN20,,, & MEM.CONT REQB,MREQ,,»MWORD & i 
AM2904 ,,EZ,EC,ES,EOVR,CEM, & PCU.NEXT & JMP START2 


NS 


/ DATAPATH ,5,,,,LDMAR,,Z11,,,, & MEM.CONT REQB,MREQ,,,MWORD & 
/ AM2964 & PCU.NEXT & JMAP 
Y 
; ROTATE RIGHT A8 
; RR R1,CNT Ri = ROTATE (R1) RIGHT CNT PLACES 
’ 

0637 RRL: ALU YBUS,PASS & AB & WORD & OEY & CARRYCTL & CONTROL & 
/ DATAPATH ZZIy55500999e9SHTCNTEN, & MEM.CONT 
/ AM2984 & PCU.NOP & LDCT 

74 RX CC: NONE 5 

2838 ALU LDR,PASS & AB & WORD & OEY & CARRYCTL & CONTROL & CNTLB H#FA & 
/ DATAPATH & MEM.CONT ,,,,MWORD & 
/ AM2904 SHIFTEN,,EZ,EC,ES,EOVR,CEM, & PCU.NOP & RPCT $ 
9 


AM29@4 & PCU.NEXT & JMAP 
ROTATE LEFT 
RX CC: NONE 


AM2964 & PCU.NOP & LDCT 


we NNO we we we ee NL 


NN 


8B3C ALU YBUS,PASS & AB & WORD & OEY & CARRYCTL & CONTROL & 
DATAPATH ,5,555,LDMAR,,Z11,,,, & MEM.CONT REQB,MREQ,,..MWORD & 


AM2964 & PCU.NEXT & JMAP 
ROTATE RIGHT THROUGH CARRY 


RRC R1,CNT 
RX CC: NONE 


AM29@4 & PCU.NOP & LDCT 


wee NN we we we 8 NO, 
f=] 
oO 
o 


O83E ALU LDR,PASS & AB & WORD & OEY & CARRYCTL & CONTROL & CNTLB H#EO & 
DATAPATH & MEM.CONT ,,,,MWORD & 
AM2S04 SHIFTEN,,EZ,EC,ES,EOVR,CEM, & PCU.NOP & RPCT $ 


OO3F ALU YBUS,PASS & AB & WORD & OEY & CARRYCTL & CONTROL §& 
DATAPATH ,,5,5,,LDMAR,,Z1II,,,, & MEM.CONT REQB,MREQ,,,MWORD & 


“NN 


/ 
re ) RR CC: NONE / AM2984 & PCU.NEXT & JMAP 
= (R2 ; 
; ROTATE LEFT TERCUGH CARRY 
’ 
; 
0648 R 
/ DATAPATH ZZ1,555202999eSHICNTEN, & MEM.CONT 


Figure 31. Microprogram for 16-Bit Computer (Cont.) 


»ereMWORD & 


0938 ALU YBUS,PASS & AB & WORD & OFY & CARRYCTL & CONTROL & 
DATAPATH ,5005,LDMAR,,ZI1,,,, & MEM.CONT REQB,MREQ,,,MWORD & 


SRA R1,CNT R1 = SHIFT (R1) RIGHT ARITHMETIC CNT PLACES 


ALU YBUS,PASS & AB & WORD & OEY & CARRYCTL & CONTROL § 


DATAPATH 2Z1,seee0005e5HTCNTEN, & MEM.CONT ,,,,MWORD & 


DATAPATH ,5555eLDMAR,,ZII,,,, & MEM.CONT REQB,MREQ,,,MWORD & 


SRL R1,CNT R1 = SHIFT (R1) RIGHT LOGICAL CNT PLACES 


RL: ALU YBUS,PASS & AB & WORD & OEY & CARRYCTL & CONTROL & 


DATAPATH ZZlysee00eer9,SHTCNTEN, & MEM.CONT ,,,,MWORD & 


yee eMWORD & 


AA RST 
RL R1,CNT Ri = ROTATE (R1) LEFT CN 


LL: ALU YBUS,PASS & AB & WORD & OFY & CARRYCTL & CONTROL & 


DATAPATH ZZ1,5559500eeeSHTCNTEN, & MEM.CONT ,,,,MWORD & 


ALU YBUS,PASS & AB & WORD & OEY & CARRYCTL & CONTROL §& 
DATAPATH Z2Z1,555¢520sse9HTCNTEN, & MEM.CONT ,,,,MWORD & 


RLC R1,CNT ROTATE (R1) CNT TIME LEFT THROUGH CARRY 


LC: ALU YBUS,PASS & AB & WORD & OBEY & CARRYCTL & CONTROL & 
ave eMWORD & 


343 


344 


AMDOS/29 AMDASM MICRO ASSEMBLER, V1.1 
MICROPROGRAM FOR 16 BIT COMPUTER 
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AM2964 & PCU.NOP & LDCT 


ALU LUR, PASS & AB & WORD & OEY & CARRYCTL & CONTROL & CNTLB H#FO & 


DATAPATH & MEM.CONT ,,,,MWORD & 
AM2904 SHIFTEN, ,EZ,EC,ES,EOVR,CEM4, & PCU.NOP & RPCT $ 


ALU YBUS,PASS & AB & WORD & OEY & CARRYCTL & CONTROL & 


DATAPATH y+95),LDMAR,,ZI1I,,,- & MEM.CONT REQB,MREQ,,,MWORD & 
AM2964 & PCU.NEXT & JMAP 


1/0 INSTRUCTIONS 


INPUT As RX CC: NONE 


IN R1,X2(D) R1 = PORT (X2) + D 
ALU YBUS,ADD & DAB & CARRYCTL & OEY & WORD & CONTROL & RTB & 


DATAPATH ,,,5>»,YMAR, LDMAR,,,ENZ®6,,, & MEM.CONT REQE,,HREQ, »MWORD & 


AM2964 & PCU.NOP & LDCT H#9@1 


ALU & WORD & OEY & CONTROL ,IOEN & CNTLB H#FD & 7 
DATAPATH & MEM.CONT ,,HREQ, ,MWORD & 
AM29@4 & PCU.NOP & RPCT $ 


ALU & WORD & CEY & CONTROL ,IOEN,, & CNTLB H#FF & 
DATAPATH & MEM.CONT ,,HREQ,,MWORD & 
AM2984 & PCU.NOP & CONT 


ALU REG,PASSR & DAB & WORD & OEY & CARRYCTL & CONTROL & 
DATAPATH ,,,,,,LDMAR,,,ENZ2®,,, & MEM.CONT REQB,,,,MWORD & 
AM2984 & PCU.NOP & JMP START1 


OUTPUT RX CC: 
OUT R1,X2(D) (R1) 


ALU YBUS,ADD & DAB & CARRYCTL & OEY & WORD & CONTROL S RTB & 


A2 NONE 
PORT (X2) + D = 


DATAPATH ,,,,,Y¥MAR,LDMAR,,,ENZO,,, & MEM.CONT REQB,,HREQ,,MWORD & 


AM2964 & PCU.NOP & CONT 


ALU YBUS,PASS & AB & CARRYCTL & OEY & WORD & 
CONTROL ,IOEN,, & CNTLB H#FB & 

DATAPATH yy9¢¢9sLDDyy99» & MEM.CONT ,,HREQ, ,MWORD & 
AM2904 & PCU.NOP & LDCT H¥661 


ALU & WORD & CONTROL & OEY & 


DATAPATH & MEM.CONT ,,HREQ,,MWORD & 
AM2964 & PCU.NOP & RPCT $ 


ALU & WORD & CONTROL ,IOEN,, & CNTLB H#FF & OEY & 
DATAPATH & MEM.CONT ,,HREQ,,MWORD & 
AM2904 & PCU.NOP & JMP START 


STACK OPERATIONS 


PUSH REGISTERS ce RR CC: NONE 
PUSH R1,RN (SP - 2) = R1 
(SP ~ 4) = 


=R 
(SP - 2*N) = RN 
SP = SP - 2*N 


ALU YBUS,PASS & AB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH Z2ZI ,55000000905 & MEM.CONT REQB,MREQ,,,MWORD & 
AM2904 & PCU.NOP & CONT 


ALU YBUS,PASS & AB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH 455+5,4DMAR,LDD,,,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2G04 & PCU.PUSH & CONT 


ALU YBUS,PASS & AB & CARRYCTL & OEKY & WORD & CONTROL § 


DATAPATH ,,,ENCTR,INC, 559957, & MEM.CONT REQB,MREQ,,WRITE,MWORD & 


TEST Q#78 & AM2204 & PCU.NOP & CJP PUSH+1 


ALU YBUS,PASS & AB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH 45595,LDMAR, 4555, & MEM.CONT REQB,MREQ,,,MWORD & 
AMZ29@4 & PCU.NOP & JMP RNI 


POP REGISTERS C1 RR CC: NONE 
POP R2,Ri R2 = (SP) 

Ri = (SP + 2) 

RN = (SP + 2*N) 

SP = SP + 2*N 


ALU YBUS,PASS & AB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH ZZ ,,,550009995 & MEM.CONT REQB,MREQ,,,MWORD & 
AM2984 & PCU.NOP & CONT 


ALU REG, PASS & AB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH ,5+,+,,LDMAR,,,EN20,,, & MEM.CONT REQB,,,,MWORD & 
AM2984 & PCU.SP & CONT 


ALU YBUS,PASS & AB & CARRYCTL & OEY & WORD & CONTROL §& 
DATAPATF ,,,ENCTR,yo¢50799 & MEM.CONT REQB,MREQ,,,MWORD & 
TEST Q#76 & AM29@4 & PCU.POP & CUP POP+1 


ALU YBUS,PASS & AB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH 45999 yLDMARyo994, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2984 & PCU.NOP & JMP RNI 


SUBROUTINE CALL c2 RX 
CALL X1(D) SP = SP - 2 

(SP - 2) = (PC) 

Pc = [(X1) + D) 


ALU YBUS,ADD & DAB & CARRYCTL & OEY & WORD & CONTROL & RTB & 
DATAPATE ,,LDTREG,,,,LDMAR,,,ENZ8,,, & 
MEM.CONT REQB,MREQ,,,MWORD & AM2984 & PCU.PUSH & CONT 


CC: NONE 


ALU YBUS,PASS & AB & CARRYCTL & WORD & CONTROL & 
DATAPATH 455+ PCUY,,LDD,.,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2984 & PCU.NOP & CONT 


ALU YBUS,PASS & AB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH ,ENTREG,,,,,LDMAR,y5595 & 
MEM.CONT REQB,MREQ,,WRITE,MWORD & AM29084 & PCU.JUMP & JMP START1 
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RETURN FROM SUBROUTINE c3 CC: NONE 
RET Pc = (SP) 
SP = SP + 2 


ALU YBUS,PASS & AB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH yo5¢e2LDMAR,,059, & MEM.CONT REQB,,,,MWORD & 
AM2984 & PCU.SP & CONT 


ALU YBUS,PASS & AB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH ,,45,,LDMAR,,,,+, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2964 & PCU.SP & CONT 


ALU YBUS,PASSR & DAB & CARRYCTL & OEY S WORD & CONTROL & 
DATAPATH ,,LDTREG,,,YMAR,LDMAR,,,ENZO,,, & 
MEM.CONT REQB,MREQ,,,MWORD & AM29604 & PCU.POP & CONT 


ALU YBUS,PASS & AB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH ,ENTREG yo550209972 & MEM.CONT ,,,,MWORD & 
AM2904 & PCU.JUMP & JMP START1 


GS 


INTERRUPT INSTRUCTIONS 


Sees rer sso Ses SSeS Ss esses sSSsltes SSS 


LOAD INTERRUPT MASK CA RI CC: NONE 
LIM DI LOAD LOWER BYTE OF DI INTO MASK REGISTER 


ALU YBUS,PASSR & DAB & CARRYCTL & OEY & WORD & 

CONTROL ,,,INTRIEN & CNTLB H#FE & 

DATAPATH ,5+5,+LDMAR,,,ENZO,,, & MEM.CONT REQB ,MREQ,,,MWORD & 
AM2964 & PCU.NEXT & JMP START2 

ENABLE INTERRUPT cs cTL CC: NONE 
EI ENABLE INTERRUPT SYSTEM 


ALU YBUS,PASS & AB & CARRYCTL & OFY & WORD & 

CONTROL ,,,INTRIEN & CNTLB H#FF & 

DATAPATH ZZ1,5.5—,LDMAR,,Z11,,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM29@4 & PCU.NEXT & JMAP 

DISPABLE INTERRUPT cS CTL CC :NONE 
DI DISABLE INTERRUPT SYSTEM 


ALU YBUS,PASS & AB & CARRYCTL & OEY & WORD & 

CONTROL ,,,INTRIEN & CNTLB H#FD & 

DATAPATH 2Z1,,4,5.,,LDMAR,,Z11,,,, & MEM.CONT REQB,MREQ,, »MWORD & 
AM2904 & PCU.NEXT & JMAP 


RETURN FROM INTERRUPT cB CTL Cc: (Sp+2) 
RTI PC = (SP) ,PSW = (SP+2) 
SP = SP + 4, INTERRUPT ENABLED 


ALU YBUS,PASS & AB & CARRYCTL & OEY & WORD & CONTROL 6 
DATAPATH ,555,eLDMAR,,,,,5, & MEM.CONT REQB,,,,MWORD & 
AM2964 & PCU.SP & CONT 


ALU YBUS,PASSR & DAB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH ,,LDTREG,,,,,,,2NZ20,,, & MEM.CONT REQB,MREQ,,,MWORD & 
AM2964 & PCU.POP & CONT 


ALU YBUS,PASS & AB & CARRYCTL & OEY & WORD & CONTROL & 
DATAPATH ,ENTREG we eyr009909 & MEM.CONT ,,,,MWORD & 
AM2904 & PCU.JUMP & CONT 


ALU YBUS,PASS & AB & CARRYCTL & OEY & WORD & 

CONTROL ,,,INTRIEN & CNTLB HA#F9 6 

DATAPATH ,5450+ LDMARss,5e, & MEM.CONT REQB,,,,MWORD & 
AM2964 & PCU.SP & CONT 


ALU YBUS,PASSR & DAB & CARRYCTL & OEY & WORD & 
CONTROL ,,,INTRIEN & CNTLB H#FF & TEST Q#86 & 
DATAPATH ,+0790599EN2Z0,,, & MEM.CONT REQB,MREQ, 


» MWORD & 
AM2904 ,,EZ,EC,ES,EOVR,CEM, & PCU.POP & JMP START 


SHEETS EEE EEF EEEEHE EEE EEH HEHE tH HH te 


INITIALIZATION ROUTINES 


FEEEEEEEEETEEFEEFE EEE EE EEE ESTEE EEE HH 
ORG H#18@ 


ALU REG,PASSR & DAB & WORD & OEY & CARRYCTL & 
DATAPATH & MEM.CONT ,,HKEQ,,MWORD & CONTROL ROM,,, & CNTLB 98 & 
IMMD H#8090 & CONT 


INITIALIZE REGISTERS IN AM29614 
RO = @, R1 = 4000H, R4 = 2, AND RS = 4 


ALU & WORD & CONTROL ,,,INTRIEN & CNTLB HAFO & 
DATAPATH & MEM.CONT ,,HREQ,,MWORD & 
AM2984 & PCU ,,PCUDZ,AS,36 & IMMD H#O@68@ & CONT 


ALU & WORD & CONTROL ,,,INTRIEN & CNTLB HaFs & 
DATAPATH & MEM.CONT ,,HREQ,,MWORD & 
AM2984 & PCU ,,PCUDZ,A1,B31 & IMMD H#¥408@ & CONT 


ALU & WORD & PCU ,,PCUDZ,A4,B4 & CARRYCTL & DATAPATH 6 
MEM.CONT ,,HREQ,,MWORD & IMMD H¥6@02 & CONT 


ALU & WORD & PCU ,,PCUDZ,A5,B5 & CARRYCTL & DATAPATH & 
MEM.CONT ,,HRFQ,,MWORD & IMMD H#6@04 & CONT 


ALU & WORD & CONTROL ,IOEN,, & CNTLB H#FF & DATAPATH & 
MEM.CONT ,,HREQ,,MWORD & AM2904 & PCU.NOP & CONT 


INITIALIZE CONSOLE AM9551 


ALU REG,PASSR & DAB & WORD & OEY & CARRYCTL & 
DATAPATH & MEM.CONT & CONTROL ROM,,, & CNTLB 18 & 
IMMD H#FFFB & CONT 


ALU REG,PASSR & DAB & WORD & OEY & CARRYCTL & 
DATAPATH & MEM.CONT & CONTROL ROM,,, & CNTLB 20 & 
IMMD H#OBCE & CONT 


ALU & WORD & CONTROL & DATAPATH & MEM.CONT & JSB I0W 
ALU REG,PASSR & DAB & WORD & OEY & CARRYCTL & 


DATAPATH & MEM.CONT & COKTROL ROM,,, & CNTLB 2@ & 
IMMD H#0®835 & CONT 





Figure 31. Microprogram for 16-Bit Computer (Cont.) 








SOT eh SPSS 


AMDOS/29 AMDASM MICRO ASSEMBLER, V1.1 
MICROPROGRAM FOR 16 BIT COMPUTER 


@18A ALU & WORD & CONTROL & DATAPATH & MEM.CONT & JSB I0W 
5 
@18B ALU & PCU.NOP & DATAPATH & MEM.CONT & JMP START 
4 HH HH HD HF AEE HEE HEE HE HED HEHE HE 
I/O WRITE SUBROUTINE 
THE ADDRESS OF I/O PORT IS IN R1 
THE DATA TO BE WRITTEN IS IN R2 
9698 38 HE MH HE HO HEH HA SE HE HEH HE HE HE HH HE HEH 
OW: ALU YBUS,PASS & AB & oe & OEY & WORD & 
CONTROL ROM, ,, & CNTLB 
DATAPATH ,,,.,,YMAR, toys 23 & MEM.CONT REQB,,HREQ,,MWORD & 
AM2964 & PCU.NOP & CONT 
18D ALU YBUS,PASS & AB & methadg & OEY & WORD & 
CONTROL ROM,» & CNTLB 20 
DATAPATH ,505+,,LDD,,,,4, & SEM. CONT REQB, ,HREQ,,MWORD & 
AM2904 & PCU.NOP & CONT 


0185 ALU & WORD & CONTROL ,IOEN,, & CNTLB H#FB & DATAPATH & 


@18c 


Oo NN id we et we we we we we we 


we QR 


/ MEM.CONT REQB,,HREQ,,MWORD & AM29@4 & PCU.NOP & LDCT H#@91 
; 
O18F ALU o WORD & CONTROL & DATAPATH & 
/ MEM.CONT ,,HREQ,,MWORD & AM2904 & PCU.NOP & RPCT $ 
; 
6198 ALU & WORD & CONTROL ,IOEN,, & CNTLB H#FF & DATAPATH & 
/ MEM.CONT REQB,,HREQ,,MWORD & AMZ924 & PCU.NOP & RIN 
} FE EEEEEEEE EEF EEE EEE EEE EEE tee ett tet 
; 
i VECTOR JUMP ENTRY POINTS 
s 
3 SHEER EEE HEHE teh $4 444444444 
; 
@1D8 ORG H#1D@ 
; 
H INTERRUPT @, PC = 19H 
3 
@1D@ INT@: ALU & WORD & CONTROL ,IOEN,INTDIS, & CNTLB B#F7 & 
/ DATAPATH & MEM.CONT ,,,,MWORD & 
/ AM29@4 & PCU ,,PCUDZ,A8,B@ & IMMD E#6@10 & CONT 
? 
@1D1 ALU & WORD & CONTROL ,,INTDIS,INTRIEN & CNTLB H#FD & 
/ DATAPATH & MEM.CONT & AM29@4 & PCU.NOP & CONT 
, 
#1D2 ALU & WORD & CONTROL ,IOEN,, & CNTLB H#FF 6 
/ DATAPATH ,,+,,,LDMAR,4,,,, & MEM.CONT REQB,,,,MWORD & 
/ AM29604 & PCU.NOP & JMP STARTI 
; 
; INTERRUPT 1, PC = 148 
’ 
@1D3 INT1: ALU & WORD & CONTROL ,,INTDIS, mare & CNTLB H#FD & 
/ DATAPATH & MEM.CONT ,,,,MWORD 
/ AM2994 & PCU ,,PCUDZ,AB,3B® & IMMD H¥O614 & CONT 
’ 


6104 ALU & WORD & CONTROL ,IOEN,, & CNTLB H#FF & 
DATAPATH ,,,,5,,LDMAR,,,,,, & MEM.CONT REQB,,,,MWORD & 
AM2964 & PCU.NOP & JMP START1 


INTERRUPT 2, PC = 18H 
ALU & WORD & CONTROL ,,INTDIS,INTRIEN & CNTLB H#FD & 


DATAPATH & MEM.CONT ,,,,MWORD & 
AM2964 & PCU ,,PCUDZ,A6,BO & IMMD H#¥@@18 & CONT 


0105 


ee NK Hae ee ee KOK 
z 
La | 
n 


@1D6 ALU & WORD & Deron »lOEN,, & CNTLB HAFF & 
DATAPATH ,,,.4,,LDMAR,,,,,, & MEM.CONT REQB,,,,MWORD & 
AM2904 & PCU.NOP & JMP START1 


INTERRUPT 3, PC = 1CH 
ALU & WORD & CONTROL ,, INTDIS,INTRIEN & CNTLB H#FD & 


DATAPATH & MEM.CONT ,,,,MWORD & 
AM2964 & PCU ,,PCUDZ,A6,B@ & ThMD H#861C & CONT 


81D7 


we NN 4 woe ee ee SO 
z 
xd 
a 


@1D& ALU & WORD & CONTROL ,IOEN,, & CNTLB H#FF & 
DATAPATH 455++eLDMAR,,,,,, & MEM.CONT REQB,,,,MWORD & 
AM2984 & PCU.NOP & JMP START1 


INTERRUPT 4, PC = 26H 
ALU & WORD & CONTROL ,,INTDIS, peeeiee & CNTLB H#FD & 


DATAPATH & MEM.CONT ,,,,MWORD & 
AM29604 & PCU ,,PCUDZ,A@,36 & IMMD H¥80628 & CONT 


@1D9 


wee NON et we ee ee NON 
= 
3 
os 


@1DA ALU & WORD & CONTROL ,IOEN,, & CNTLB H#FF & 


DATAPATH 4595+ LDMARyy555» & MEM.CONT REQB,,,,MWORD & 
AM2964 & PCU.NOP & JMP STARTI 


INTERRUPT 5, PC = 248 


ALU & WORD & CONTROL ,,INTDIS, pee es & CNTLB HAFD & 
DATAPATH & MEM.CONT ,,,,MWORD 
AM2964 & PCU ,,PCUDZ,AS,B® & IMD H#¥@€24 & CONT 


@1DC ALU & WORD & CONTROL ,IOEN,, & CNTLB H#FF & 
DATAPATH 45559,LDMAR+,55,,5 & MEM.CONT REQB,,,,MWORD & 
AM2964 & PCU.NOP & JMP STARTI 


@1DE 


MeN Nt ee we wee NON 
= 
3 
oO 


INTERRUPT 6, PC = 288 


we NN hte ee we 
=z 
3 
fe.) 


@1DD ALU & WORD & CONTROL ,,INTDIS,INTRIEN & CNTLB H#FD & 
DATAPATH & MEM.CONT ,,,,MWORD & 
AM2904 & PCU ,,PCUDZ,A9,B® & IMMD H#¥@@28 & CONT 
@1DE ALU & WORD & CONTROL ,IOEN,, & CNTLB H#FF & 
/ DATAPATH ,55,,,LDM Mis ces & MEM.CONT REQB,,,,MWORD & 
/ AM2964 & PCU.NOP & JMP START 
, 
; INTERRUPT 7, PC = 2CE 
, 
O1iDF INT? ALU & WORD & CONTROL ,,INTDIS, eee & CNILB HAFD & 
/ DATAPATH & MEM.CONT ,,,,MWORD 
/ AM2964 & PCU ,,PCUDZ,A@,BO & INMD H#@02C & CONT 
, 
G1ES ALU & WORD & CONTROL ,IOEN,, & CNTLB H#FF & 
/ DATAPATH sei PeLOMARG svi & MEM.CONT REQB,,,,MWORD & 
/ AM2964 & PCU.NOP & JMP STARTI 
; Bese SSS SSS SSeS S SS SSS SSS SSSS5 Ee ESser== 
; 
3 INTERRUPT HANDLER 
, 
; Ser St rSs FSS SSS SSS SSS SS SS SST SESS SE STE 
, 
G1F8 ORG H#1F6 
; 
@1F6 INTR: ALU & WORD & CONTROL & 
/ DATAPATE & MEM.CONT REQB, MREQ,,WRITE,MWORD & 
/ AM29604 & PCU.DEC4 & CONT 
, 
@1F1 ALU & WORD & CONTROL & 
/ DATAPATH ,455+5LDMAR,,,.,, & MEM.CONT ,,,,MWORD & 
/ AM2964 & PCU.PUSH & CONT 
, 
@1F2 ALU & WORD & CONTROL & 
/ DATAPATH ,,,,,PCUY,,LDD,,,,, & MEM.CONT REQB,,,,MWORD & 
/ AM2964 & PCU.NOP & CONT 
; 
B1F3 ALU & WORD & CONTROL ,,,INTRIEN & CNTLB H¥F5 & 
/ DATAPATH & MEM.CONT REQB,MREQ,,WRITE,MWORD & 
/ AM2994 & PCU.NOP & JMPV 
; MH HH Ae HH eae he He ae eH ea IC a a HE I a ea 
; 
; INTERRUPT ENTRY POINT 
: MEME AEE A AR A Be ae a a ea ae A a a Ra a a ee 
; 
@1FF ORG H#1FF 
; 
O1FF ALU & WORD & CONTROL & 
/ DATAPATH ,55¢,,LDMAR,LDD,,,PSW,, & MEM.CONT REQB,MREQ,,,MWORD & 
/ AM2S84 & PCU.PUSH & JMP INTR 
; 


END 





Figure 31. Microprogram for 16-Bit Computer (Cont.) 


MICROCODE TRANSLATION 


It is often convenient for the microprogrammer to assign mi- 
croword fields such that they occupy positions that differ from 
those in the actual hardware implementation. This is often the 
case when the microprogrammer, for convenience, allocates bits 
according to the functions to be performed and then needs to 
translate the object code produced by AMDASM¢® to be consis- 
tent with the hardware microprogram memory design. 


There is another instance where the ability to shift bit assignment 
is important to the engineer. As a given product evolves, bits may 
be added or deleted from the original microword format. When 
this occurs, a mapping function is desired to minimize hardware 
changes. 


The program in SYSTEM/29® that performs such a mapping 
function is called AMSCRM. The AMSCRM maps the output of 
AMDAGSM (logical bit pattern) into the bit pattern that is consistent 
with the 16-bit computer hardware. A table of the logical to physi- 
cal mapping is shown in Table 3. 


ENGINEERING MODEL AND MACROCODE 


With the proper tools — designing, microprogramming, prototyp- 
ing, and checking out a new computer design is not overly dif- 
ficult. The major tools used for the high- epere 16-bit design 
described in this application note was System 29"). System 29 is 
a software driven hardware prototyping system which allows 
microprogramming, hardware design/checkout, and macropro- 
gramming (programming in the language of the target machine) 
to occur simultaneously. At the point where the design is reason- 
ably rigid, and the hardware is mostly fabricated, System 29 
allows the engineer to create “instant” microprograms to check 
out the new computers’ internal data paths. Microprogram 
software support features of System 29 also allow the engineer to 
single cycle, single instruction step, instruction trace, and trap on 
pre-specified events coming true. Simultaneously with this initial 
internal check-out, the microcode for some very simple machine 
instruction should be written (i.e., load register, add register, or 
register, etc.). The next step is to check out the main memory 
paths with load and store instructions. At this point, a-reasonable 
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Table 3. 


BE HE ae He Me He ae He Be ae ae He Ae He ae eae ae ae ae a ae ke he ae ake ee he he a eae 


BIT ASSIGNMENT FOR 16-BIT COMPUTER 


MEME 2 ae Ae ae ae ah 3 he ae a 2 ae ae ae he a ake ake he ae a a ae Me ae I Ne ak a] aI 


BIT POSITION MNEMONIC * DESCRIPTION 
LOG PHY 

95 95 RTB = REG. FIELD 2 TO B PORT OF AM2983 
94 SPARE 

¢3 SPARE 

92 54 221 LOAD 2 REG. INTO ZI REG. 

91 94 CCEN * AM2918 CONDITON CODE ENABLE 


AM2983 ALU CONTROL BITS 


98 93 WORD = WORD MODE = @, BYTE MODE = 1 
89 92 EA * ENABEL A LATCH ON AM2983 
88 91 OEY * ENABLE Y OUTPUT ON AM29083 
87 96 OEB * ENABLE B LATCE ON AM2903 
86-78 89-81 18-18 INSTRUCTION LINES FOR AM2983 


DATAPATH BITS 


77 86 ENTREG * ENABLE TRANSFER REG 

76 79 LDTREG LOAD TRANSFER REG. 

75 78 ENCTR * I-REG ENABLE COUNTER 

74 77 INC I-REG INC=1/DEC=0 

73 76 PcucD PCU TRANSFER CHIP DISABLE 

72 75 PCUY PCU TRANSFER TO Y-BUS 

71 74 LDMAR LOAD MEMORY ADDRESS REGISTER 
76 73 LDD LOAD D-REGISTER 

69 72 zII LOAD ZI INTO I REGISTER 

68 71 ENZO * ENABLE Z@ REGISTER TO DA BUS 
67 70 PSW 7 ENABLE PSW REGISTER TO DA BUS 
66 69 SHTCNEN * SHIFT COUNT AM2918 ADDRESS 
65 68 BRIEN * BRANCH INSTRUCTION ENABLE 


AM2961A PROGRAM CONTROL UNIT 


64 67 PCUI7 F TO B-RAM = 1 (DEFAULT), F TO Q-REG = @ 
63 66 PCUIS ADD = 1 (DEFAULT), SUB = @ 

62-68 65-63 PCUI2-9 PCU SOURCE CONTROL 

59-57 62-60 PCUA2-6 PCU ARAM SELECT 

56-54 59-57  PCUB2-8 PCU B-RAM SELECT 


MEMORY CONTROL 


53 92 REQB BUS REQUEST 

52 51 MREQ MEMORY REQUEST 

51 5@ HREQ * HOLD REQUEST 

5@ 4S WRITE MEMORY READ = @ (DEFAULT), MEMORY WRITE = 1 

49 48 MWORD MEMORY BYTE OP = @ (DEFAULT), MEMORY WORD OP = 1 


CONTROL BIT STROBES 


48 56 IMMD * ENABLE IMMEDIATE FIELD TO DA BUS 

4? 47 ROM I-REG ENABLE = @ (DEFAULT), ROM ENABLE = 1 
46 46 IOEN * I/O CONTROL REGISTER ENABLE 

45 45 INTDIS od AM2914 INTERRUPTS DISABLE 

44 44 INTRIEN * AM2914 INSTRUCTION ENABLE 

45 43 SHFTEN ‘s AM2964 SHIFT ENABLE 


GENERAL PURPOSE CONTROL BITS 
42-35 42-35 CNTLB7-6 BITS TO BE STROBED BY CONTROL STROBES 
AM29@4 STATUS REGISTER CONTROL BITS 


34 34 OECT * OUTPUT ENABLE OF CONDTIONAL TEST 
33 33 EZ = ENABLE ZERO FLAG UPDATE 
32 32 KC * ENABLE CARRY FLAG UPDATE 
31 31 ES = ENABLE SIGN FLAG UPDATE 
38 38 EOVR * ENABLE OVERFLOW FLAG UPDATE 
29 29 CEM * ENABLE MACHINE STATUS REGISTER 
28 28 CEU * ENABLE MICROPROGARM STATUS REGISTER 
2? 27 112 AM2984 112 CARRY OUT CONTROL 
26 26 T11 AM2904 CARRY OUT CONTROL 
TEST BITS 
25-235 25-25 TESTS5-3 AM2984 TEST BITS 
22-28 22-28 TEST2-9 AM2964 & AM25LS251 TEST BITS 


AM2918 SEQUENCE CONTROL 

19-16 19-16 NACS-@ AM2919 NEXT ADDRESS CONTROL 
NEXT MICRO ADDRESS OR IMMEDIATE FIELD 

15-8 15-8 M15-6 SHARED FIELD FOR NEXT ADDRESS OR IMMD 
END 


instruction sub-set should be microprogrammed (a phase 1 in- 
struction set) that will allow a simple monitor to be written in the 
target machines’s language. This monitor should run on the 
target machine and provide commands for: memory display, 
memory store and jump to memory location. The phase 1 instruc- 
tion set and simple monitor now provides the basic foundation for 
completing the full computer design. 


The standard System 29 configuration provides automatically for 
microcode and hardware development. In order to efficiently 
develop and implement the target machine’s software, a target 
machine assembler and a mechanism for loading the machine’s 
main memory must be provided. System 29 uses an AmM9080A 
microprocessor, dual floppy disks, and a full function disk 
operating system to support microprogrammed hardware and 
firmware development. The Am9080A microprocessor can ad- 
dress 64k bytes of memory. The disk operating system uses only 
the first 32k bytes and the remaining 32k is used to memory map 
(page) functions from the hardware development side. Through 
this mechanism, the designer has the ability to directly load and 
manipulate microprograms, monitor hardware functions, etc. 
There are extra enable lines from the page register which allow 
the System 29 user to map other functions into the support 
processor’s upper 32k of memory. 


The main memory of this 16-bit high-speed computer design was 
mapped into the support processors upper 32k via one of the 
unused page register enable lines. Besides the normal 16-bit 
interface, a simple 8-bit interface was added to the main memory 
thus making it a simple two port memory. When the 16-bit com- 
puter is halted (via a System 29 command) location 0 of 16-bit 
main memory would be addressed as location 8000 hex of Sys- 
tem 29 support processor memory. Location 1 would be 8001, 2 
would be 8002, etc. This affected a mechanical link between the 
16-bit prototype design and System 29. 


In order to efficiently write a reasonably complex piece of 
software (such as a simple monitor), an assembler for the target 
instruction set is needed. Since this 16-bit computer design is not 
exactly like any other 16-bit computer, ready to run software tools 
are not available. A macro assembler is available as an optional 
enhancement to the System 29 software base. Even though this 
macro assembler is for programming in AmM9080A assembly lan- 
guage, there is a user installable patch which will disable all of the 
Am9080A operation codes (Figure 32). With this patch installed, 
the user may now write a macro library defining the target 
machine’s instruction set. It is not necessary to code the entire 
instruction set, as the first level of programming for the new 
machine (simple monitor, etc.) will be using only the phase 1 
instruction set. A complete macro library of the AMD high- 
speed 16-bit computer phase 1 instruction set is contained in 
Appendix B. 


Now that the tools are in place, it is relatively simple to code and 
implement a simple monitor for the target machine. Appendix C 
contains the complete simple monitor listing for the AMD high- 
speed 16-bit computer. Only the phase 1 instruction set was used 
which does not include byte instruction, call and return instruc- 
tions, stack instructions, any special instructions, etc. This simple 
monitor understands three commands: Display (D), Store (S), 
and Jump (J). Typing D followed by an address value will display 
256 bytes of main memory beginning on the address given 
(rounded back to the nearest eight word boundary). Typing an S 
followed by an address, followed by data, will store the data 
consecutively, on a nibble basis beginning at the given address. 
Typing in J followed by an address will cause the processor to 
begin execution at the main memory location given by the ad- 
dress. Commands, addresses, and data must be separated by at 
least one delimiter (space, comma, or period). 


The change file shown below can be integrated into MAC to produce a new program, which we will call 
MAC29. The MAC29 program will not recognize 8080 mnemonics, but will recognize all the MAC pseudo 


operators and arithmetic functions. 


=e @6 @O BE 


25 
26 
2561H 


RT EQU 
pT EQU 
TAREA EQU 


c 


24444 
TAREA 


ORG 
JMP 


ORG 
TYPE IS 
CPI 

Jc 

CPI 

RNZ 


TAREA 


RT 
TY PEOK 
PT 


TYPEOK: INX 
MOV B,M 
CMP A 
RET 


END 


MACRO ASSEMBLER 


“MAC" CHANGES TO DISABLE 8089 OPCODES. 


78086 REGISTER NAME 
;PSEUDO OPCODE TYPE 


;FREE AREA IN TOKEN 


;OVERLAY INX H 


MODULE 


MOV B,M RET 


IN THE ACCUMULATOR 


;BELOW RT IF ARITH OP 


;PSEUDO OP? 
;RETURN WITH NON-ZERO FLAG 
; OTHERWISE, PSEUDO OP OR ARITH OP 


H 


;SET ZERO FLAG 





Figure 32. Macro Assembler Disable Opcode Patch. 
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After writing the monitor, and putting it onto floppy disks via the 
System 29 editor, it must be assembled using the modified macro 
assember (described earlier). The result of the assembly is a hex 
file which is suitable for loading into the 16-bit computer’s main 
memory. This hex file is now loaded into support processor mem- 
ory beginning at location 8000 hex. As discussed previously, this 
is mapped at location zero in the 16-bit computer's main memory. 
Assuming the microcode is loaded and a terminal is connected to 
the 16-bit computer, the monitor in 16-bit main memory may now 
be executed. The complete System 29 session from editing and 
assembling the monitor to loading and executing it is given in 
Appendix D. 


SUMMARY 


As can be seen throughout these application notes, designing a 
high performance Bipolar microprocessor system is a straight- 
forward task. The Am2900 Family is ideally suited to provide 
building blocks for the various elements of the computer. These 
include the Computer Control Unit, the Central Processing Unit, 
the Program Control Unit, the Interrupt Structure and the various 
bus controls. Together, these elements allow the designer to 


build computers using the current state-of-the-art architecture 
with LSI building blocks. 


As technology improves, Advanced Micro Devices has been able 
to redesign these building blocks to offer increased performance. 
Thus, the Am2901 has evolved through an Am2901A, then an 
Am2901B and now an Am2901C is in the planning. In addition, the 
Am2903 offers additional architectural advantages and soon an 
Am29103 will provide additional speed and performance fea- 
tures. Similarly, the microprogram sequencer area began with the 
Am2909 and Am2911; then was followed by the larger Am2910. 
Soon, the Am2909A and Am2911A will provide higher speed in 
the microprogram sequencer area and will be followed by an 
Am2910A. 


Thus, the future for Bipolar LS! building blocks includes not only 
more advanced product designs offering higher levels of integra- 
tion and new functions for new architectures, but also offers 
higher performance versions of the already existing products. 
Advanced Micro Devices is committed to providing high perfor- 
mance Bipolar LSI circuits utilizing proven technology designed 
to operate over the full military operating range as well as the 
commercial operating range. As always, these products continue 
to meet the performance requirements of MIL-STD-883. 





APPENDIX A 
Complete Description of Instructions 


LOAD 


RX, RSI 


The second operand is loaded into the general register specified 
by R, ‘ 


STORE 


D 
x< 
pe) 
” 


’ 


The first operand specified by R, is stored at the location 
specified by the second operand. 


D 
x< 
D 
” 


R 


x< 


The first operand (16 bits) with carry is added to the second 
operand and replaces the first operand. 


RX, RS! 


The second operand ts subtracted from the first operand and 
replaces the first operand. 


R 


x< 


The second operand (16 bits) with carry is subtracted from the 
first operand and replaces the first operand. 


> 


ND 


D 
x 
D 
ee 


The AND of the first operand and the second operand replaces 
the first operand. 


O 


R 
RR, RS, SS 


RX, RSI 





The OR of the first operand and the second operand replaces the 
first operand. 
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x< 


OR 


D 
x< 
pe) 
o) 


The logical difference of the first operand and the second operand 
replaces the first operand. 


RX, RS! 


a! 
Ey 
Le) 


The first operand and the second operand are logically ANDed. 


The contents of R; and X» are unchanged. 


COMPARE 


D 
x< 
D 
” 


BE 
EI 
Le) 


The first operand is algebraically compared with the second 
operand. The result Is indicated by the condition code. 


COMPARE LOGICAL 


The first operand is compared logically to the second operand. 
The result is indicated by the condition code. 


MULTIPLY 


R 


x< 


The first operand (R; + 1) is multiplied by the second operand 
and the 32-bit product is contained in R; and R; + 1 registers. R, 
must be an even address The sign of the product is determined 
by the rules of algebra. 


MULTIPLY UNSIGNED 


R 


x< 


The first operand (R; + 1) is multiplied by the second operand 
and the 32-bit product is contained in R,; and (R; + 1). Ry must be 
even. 


LOAD BYTE 


D 
x< 
D 
x 


’ 


The 8-bit byte stored in the low order byte of the second operand 
location is stored in the low order byte of R,. The high order byte 
of the R, is set to zero. 


Bs) 
x< 
D 
ao) 


’ 


The byte at the second operand location is loaded into the low 
order byte of R; without changing the contents of the high order 
byte of R;. 


D 
x 
D 
” 


The least significant byte of the first operand is stored in the 
location specified by the second operand. The other byte of the 
second location is unchanged. 





R 


x< 


The bytes specified by the first and second operands are ex- 
changed. When the operand specifies a register (i.e. R;, Ro) only 
the low order byte is exchanged. 


BYTE SWAP 
RR, RS 


pm | Re 
pa | ee 


into the register specified by the first operand. 


R 


i 


— 
= 
@ 
+ 
= 
.o) 
oy 
< 
> 
£4] 
” 
12) 
=~ 
~ 
a 
ca) 
n 
© 
2) 
Oo 
S 
Q. 
{°) 
Oo 
@ 
“~ 
ped) 
=) 
Q. 
0) 
“= 
@ 
” 
= 
pe) 
Ko) 
xe) 
© 
Qo 
el) 
=) 
a 
O 
i) 
a. 
oO 
Q 


COMPARE LOGICAL BYTE 


RR, RS 


The low order byte of the first and second operands are com- 
pared. The result is indicated in the condition code. 


Be) 
x< 
6 
” 


AND BYTE 


RX, RSI 


The AND of the low order bytes specified by the first second 
operands replace the first operand low order byte. The high order 
byte of R, ts set to zeros 


D 
< 
D 
2 


The OR of the low order bytes specified by the first and second 
operands replace the first operand low order byte. The high order 
byte of R; 1s set to zero. 


XOR BYTE 


RX, RSI 


The XOR of the low order bytes specified by the first and second 
operands replace the first operand low order byte. The high order 
byte of R; Is set to zero. 


LOAD PROGRAM STATUS WORD 


A 32-bit new PSW ts loaded from the memory location specified 
by the second operand as the current PSW. 


R 


x< 


EXCHANGE PROGRAM STATUS 


R) 
5 


fa" 
D 


PSW (0:15) —> ( 
R2 - PSW (0:1 


STORE PROGRAM STATUS WORD 


The 32-bit PSW is stored at the location specified by the second 
operand. 


R 


x< 
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SUPERVISOR CALL 


pa | me 


OLD PSW = | (Xz) + d] 
(Xo) + d] + 4 NEW PSW 


i 
x< 


—_—_ 


SET, CLR, COMPLEMENT, TEST BIT PSW 


The condition flags in the current PSW are set, cleared, com- 
plemented, or tested. N defines the bit(s) to be affected or tested. 


CALL 


| 

x< 

JD 

| i 


Jump to the memory location specified by the second operand 
and push PSW (16:31) onto stack. 


STACK > PSW (16:31) 


PUSH 





vu 
= 
Cc 
n 
xr 


Po 
EF 
ps) 


R, THRU Ry > STACK 


P/POP 


Ea 
Ear 
D 


STACK > R, THRU R> 


LOAD STACK POINTER 
LOAD STACK LIMIT LOWER 
LOAD STACK LIMIT UPPER 


STORE STACK POINTER 
STORE STACK LIMIT LOWER 
STORE STACK LIMIT UPPER 


The stack point, stack limit lower or upper is read from or written 
into the address defined by the second operand. 


DD 
D 


4 
Be) 
> 
Zz 
oD) 

a ; 
pall 
mM 

= 


R, 


LENGTH 


ADDRESS 


Ro 


ADDRESS 


The addresses specified by R; + 1 and Ro define two tables, R, 
+ 1 address is the top location of a table to be translated, Ro 
address the first location of the translation table. The value (one 
byte) pointed to be the R; + 1 address ts indexed by (added to) 
the address value of Re to find the translation code. This transla- 
tion code replaces the value pointed to by the R,; + 1 address. 
After one byte is translated, the length is decremented and the 
address of R; + 1 incremented and the instruction repeated, until 
the length equals zero. This instruction is interruptable. If this 
instruction ts interrupted, the PC is left pointing to this instruction 
so that this instruction can be resumed after the interrupt service 
is complete. 


TRANSLATE AND TEST 


RR 
or | | me 
Ry 






LENGTH 


ADDRESS 


Ro 





ADDRESS 


This instruction proceeds like translate except that the bytes of 
the first operand (defined by R,) are not changed in storage. 
When the bytes of the translate table (Rz) the instruction pro- 
ceeds to the next byte of the first operand. If the byte of the 
translate table is not zero, the instruction is halted with the ad- 
dress pointed to last in the translate table held in register 1. 


MOVE LONG 





Moves bytes defined by R; to Ra. Both adresses incremented 
after each transfer. This instruction is interruptable. 


COMPARE LONG 





Re 
ADDRESS 


Compares the first operand against the second operand. The 
length is decremented and the address incremented after each 
compare. When length = zero of the bytes compared are not 
equal, the instruction is halted. 


R 


x< 


The upper 16 bits of the instruction at the second operand is 
‘OR’ed with R, and executed. 


Nibbles in operand 1 and operand 2 are added. The result is 
placed in operand one. 


DECIMAL SUBTRACT 


Nibbles in operand 2 are subtracted from nibbles in operand 1 
and the result is placed in operand 1. 


DECREMENT INDEXES 


R,; —-1—> R, 
2-1-7>R, 


D 


One is subtracted from R, and the result placed back into R;. One 
is subtracted from Ro and the result placed back into Ro. R; and 
Ro may specify the same register with will effectively subtract two 
from that register. 


SHIFT RIGHT ARITHMETIC 
SHIFT RIGHT DOUBLE ARITHMETIC 


pa |e 


The contents of R,; for single shifts and R;, R; + 1 for double 
shifts are shifted the number of places specified by the second 
operand. The sign bit is unchanged. Bits shifted in are set equal to 
the sign bit. Bits shifted out are shifted through the carry bit. 


D 
x< 
pe) 
ee 


’ 
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ROTATE RIGHT 
ROTATE RIGHT DOUBLE 


pm | Pe 


The contents of R, for single shifts and R;, R; + 1 for double 
shifts are rotated right the number of places specified by the 
second operand. 


RX, RSI 


SHIFT LEFT ARITHMETIC 
SHIFT LEFT DOUBLE ARITHMETIC 


pa | me 


The contents of R, for single shifts and R; + 1 for double shifts 
are shifted left the number of places specified by the second 
operand. The high order bit (sign bit) of the register a register pair 
is unaffected by the shift. Low order bits are filled with zeros. If a 
bit unlike the sign bit is shifted out of the position adjacent to the 
sign bit, the overflow flag is set. 


RX, RSI 


ROTATE LEFT 
RX, RSI 


The contents of R,; for single shifts and R;, R; + 1 for double 
shifts are rotated left, the number of places specified by the 
second operand. 


SHIFT RIGHT LOGICAL 
SHIFT RIGHT DOUBLE LOGICAL 


pa [re 


The contents of R, for single shifts and R; + 1 for double shifts 
are shifted right the number of places specified by the second 
operand. High order bits shifted in are zeros, low order bits shifted 
out are shifted through the carry bit. 


D 
x< 
5s) 
O) 


’ 


SHIFT LEFT LOGICAL 
SHIFT LEFT DOUBLE LOGICAL 


RX, RSI 


poo fatal «| 


The contents of R, for single shifts and R;, R; + 1 for double 
shifts are shifted left the number of positions specified by the 
second operand. High order bits shifted out are shifted through 
the carry bit. Zeros are shifted in. R,; for double shifts must be 
even. 


R 


x< 


One 16-bit word of data is read into the first operand from the 
device which is addressed by the contents of the second 
operand. 


INPUT BYTE 


oo fa 


One byte of data is read into the low order 8 bits of the first 
operand from the device which is addressed by the contents of 
the second operand. 


2 |, 
D 


R 


x< 


EI 
Le) 


R 


x< 


The 16 bits of R; is sent to the device which is addressed by the 
contents of the second operand. 


R 


x< 


The low order 8 bits of R; is sent to the device which is addressed 
by the contents of the second operand. 


R 


WM 


RX 


39) 
Dv 
> 
Zz 
2) 
ae 


Unconditionally branch to the location specified by the second 
operand. The first operand is not used. 


BRANCH ON CONDITION 


RX 


Branch to the location specified by the second operand if the 
condition code specified in the first operand postion is equal to the 
current PSW status bits. 


Condition codes are: 


Carry =B (Sign=0) 

No Carry =A Minus =F 
Zero = 5 (Sign=1) 

Not Zero =4 1’s Comp> =9 
2's Comp> =0 1’s Comp< = 8 
2's Comp< = 3 1’s Comp> =C 
2's Comp> 2 1’s Comp< =D 
2’s Comp< = 1 Overflow =7 
Plus =E Not Overflow = 6 


BRANCH AND LINK 


BALR RS 
po pa 
BAL RX 


a 
nN 


The address of the next sequential instruction is saved in R;, and 
an unconditional branch to the jump address is taken. 


BRANCH ON INDEX 


BXH HIGH RX 
oo fafu] 8 | 
BXLE LOW OR EQUAL RX 


Bi 
Le) 


R, is incremented by the value in R, + 1, and logically compared 
to the index limit held in Ry +2. 


INDEX HIGH 
(Ry) + (Ry + 1) > (Ry) 


(Ry) : (Ry + 2) 
IF (R,) > (Ry + 2) THEN d + (X») > PSW (16:31) 


IF (Ry) < (Ry + 2) THEN PSW (16:31) + 2 > PSW (16:31) 
INDEX LOW OR EQUAL 


(Ry) + (Ry + 1) — (Ry) 
(Ry) : (Ry + 2) 
(R;) < (Ry + 2) THEN d + (X) > PSW (16:31) 


IF (Ry) > (Ry + 2) THEN PSW (16:31) + 2 — PSW (16:31) 
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PS by bo bd Pot fy Fd ty wo we we we we we wows 
NQNOohWNH ES 


Oo 
©@M 


R18 
R11 
R12 
R135 
R14 
R15 


x1 
X2 


X4 
X5 


X7 
X8 


X10 
X11 
x12 
X13 
X14 
x15 


~wewe we 


CY? 
NC? 


NZ? 
GT? 


GE? 
LE? 


MI? 
HI? 
LS? 
HS? 
LO? 
OV? 


tot we we we we we we we we we 


pw we we we 
is] 


© we we we zw ee we we GT) we wewe 


CQ we we we 


MERE Re Me He HME Me He Ne EM Me Me he Ne A eA Me ee a HH I He HI 


MACRO DEFINITIONS FOR MICRO/29 


HBR He EM Me ae eM Me ee he Ne eae ee ae a He he a a a ae eae Ae 
DEFINITIONS FOR CPU REGISTERS 


SET 
SET 
SET 
SET 
SET 
SET 
SET 
SET 
SET 
SET 
SET 18 
SET 11 
SET 12 
SET 13 
SET 14 
SET 15 
SET 
SET 
SET 
SET 
SET 
SET 
SET 
SET 
SET 
SET 
SET 
SET 
SET 
SET 
SET 
SET 


PRESET CONDITION CODES 


SET @BH ; CARRY 
SET OAH 3NO CARRY 


SET @5H 

SET 04H sNOT ZERO 

SET OOH 32°S COMP GREATER THAN 

SET 03H 32°S COMP. LESS THAN 

SET 62H ; 

SET 9018 ; 

SET OEH ; 

SET @FH ; 
; 
3 
’ 
; 
; 
; 


ODBNOOAANY@ 


ODNVAU FUN eE DB 


> 2 pn ee pe 
Or ANH & 


SET O98 
SET 8H 
SET @CH 
SET @DH 
SET O7H 


1 
1 
1 
1°S COMP. LOWER 
0 
SET O6H N 


LR LOAD REGISTER 
MACRO R1,R2 

DB 18H,R1*16H+R2 

ENDM 

AR ADD REGISTER 
MACRO R1,R2 

DB 1AH,R1*10H+R2 

ENDM 

SR SUBTRACT REGISTER 
MACRO R1,R2 

DB 1BH,R1*10H+R2 

ENDM 

NR AND REGISTERS 
MACRO R1,R2 

DB 14H,R1*10H+R2 

ENDM 

ORR OR REGISTERS 
MACRO R1,R2 

DB 16H,R1*10H+R2 

ENDM 

CLR COMPARE LOGICAL REGISTERS 


MACRO R1,R2 


°S COMP. LOWER OR SAME 
“S COMP. HIGHTER OR SAME 


18 


1A 


1B 


14 


16 


15 


APPENDIX B 


Pq we WO we 


we 


Yd we we we wewe we we 


O wewe we wwewe wo CN we we we be we we we CS we we we 


Zi wo we we Ct we we we we we 8 we we we CQ we we we 


we 


foe wo we we 
| come 


CT ewe wo 


DB 15H,R1*10H+R2 
ENDM 


XR EXCLUSIVE OR REGISTERS 17 
MACRO R1,R2 

DB 17H,R1*1@H+R2 

ENDM 


SSSSssSsssseeS Se SESS 52S 5555525 >= 


RX TYPE INSTRUCTIONS 


LD LOAD MEMORY 58 
MACRO R1,X2,DI 

DE SOR RETIN Re CR) SHR 8,(DI) AND OFFH 
ENDM 


ST STORE IN MEMORY 59 
MACRO R1,X2,DI 

DB 50H,R1*10H+X2,(DI) SHR 8,(DI) AND OFFH 
ENDM 

ADD ADD FROM MEMORY 5A 
MACRO R1,X2,DI 

DB 5AH,R1*10H+X2,(DI) SHR 8,(DI) AND OFFH 
ENDM 

SUB SUBTRACT FROM MEMORY 5B 


MACRO kR1,X2,DI 
DED asmi ra en rae (DE) SHR 8,(DI) AND OFFH 
E 


N AND WITH MEMORY 54 
MACRO Ri,X2,DI 

DB 54H,R1*10H+X2,(DI) SHR 8,(DI) AND OFFH 
ENDM 

0 OR WITH MEMORY 56 
MACRO R1,X2,DI 

DB 56H,R1*10H+X2,(DI) SHR 8,(DI) AND OFFH 
ENDM 

CMP COMPARE WITH MEMORY 55 
MACRO R1,X2,DI 


ee ee SHR 8,(DI) AND OFFH 
EN 


IMMEDIATE INSTRUCTIONS 
LI LOAD IMMEDIATE 41 
MACRO R1,12 
DB 41H,R1*10H,(I2) SHR 8,(I2) AND OFFH 
ENDM 
NI AND IMMEDIATE 94 
MACRO R1,12 
DB 94H,R1*10H,(I2) SHR 8,(I2) AND OFFH 
ENDM 
OI OR IMMEDIATE 96 
MACRO R1i,12 
DB 96H,R1*10H,(12) SHR 8,(12) AND OFFE 
ENDM 
XI EXCLUSIVE OR IMMEDIATE 97 
MACRO R1,12 
DB 97H,R1*10H,(I12) SHR 8,(12) AND OFFE 
ENDM 
Al ADD IMMEDIATE OA 
MACRO R1,12 
DB 9AH,R1*16H,(I2) SHR 8,(I2) AND OFFE 
ENDM 
SI SUBTRACT IMMEDIATE OB 
MACRO R1,12 


DB OBH,R1¥*10H,(I2) SHR 8,(12) AND OFFH 
ENDM 


CD Oe we we 


tr we we we we we we we we we 


to we we we 


Go we we we 


ad 
et 


tr we wo we we 


> 
te 
bo 


to we we we 


CT we we we we we wt we we we 
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; 
CI COMPARE IMMEDIATE 95 , SRL SHIFT RIGHT LOGICAL 88 
; 
MACRO R1,12 SRL MACRO R1,CT 
DB 95H,R1*1@H,(I2) SHR 8,(12) AND OFFH DB 88H ,R1*10H+(CT-1) 
ENDM ENDM 
; 
Sexson e ee seesesossc ces sSee Sse Sees SSS Se Ss ; SLL SHIFT LEFT LOGICAL 89 
’ 
BRANCH AND CONITIONAL BRANCH INSTRUCTIONS SLL MACRO R1,CT 
DB 89H ,R1*10K+(CT-1) 
’ 
BX UNCONDITIONAL BRANCH 74 ; SRA SHIFT RIGHT ARITHMETIC 8A 
; 
MACRO X1i,DI SRA MACRO R1,CT 
DB 74H,X1¥*10H,(DI) SHR 8,(DI) AND OFFE DB 8AH,R1*10H+(CT-1) 
ENDM ENDM 
’ 
BC CONDTIONAL BRANCH 47 i RRL ROTATE RIGHT AB 
’ 
MACRO CC.X2.DI RRL MACRO R1,CT 
DB 47H, CC#19H+X2, (DI) SHR 8,(DI) AND OFFE DB SASH ,R1*10H+(CT-1) 
ENDM ENDM 
, 
BAL BRANCH AND LINK 45 i RLL ROTATE LEFT AA 
J 
MACRO R1,X2.DI RLL MACRO R1,CT 
DB 45H,R1*10H+X2,(DI) SHR 8,(DI) AND OFFE DB OAAH,R1*10H+(CT-1) 
ENDM ENDM 
’ 
; SSS os ss SSeS SSFS SSS SSE S SSS seze=>SeSE= 
BALR BRANCH AND LINK REGISTER 65 ; 
3 I/O INSTRUCTIONS 
MACRO Ri,R2 i 
DB 05H ,R1*10H+R2 i Sesssssrtssoessresesssescesseozsessqaz= 
ENDM ’ 
; IN INPUT AS 
BR BRANCH REGISTER UNCONDITONAL 04 ; 
IN MACRO R1,X2,DI 
MACRO R1 DB @AGH,R1*1@H+X2,(DI) SHR 8,(DI) AND OFFH 
DB 04H,R1*10H p ENDM 
ENDM ’ 
; OUT OUTPUT A2 
SEC ecrc se SSSSS SEF CSS STEVES SES SS SSS TSESSSs= ’ 
OUT MACRO R1,X2,DI 
SHIFT AND ROTATE INSTRUCTIONS DB OA2H,R1*1O0H+X2,(DI) SHR 8,(DI) AND OFFE 
ENDM 
SSS rc Sree Ss SS SS SSS SSSS5 SSS SSS SS 555555555 ’ 
SLA SHIFT LEFT ARITHMETIC 8B 


MACRO R1,CT 
DB 8BH,Ri*1@H+(CT~-1) 
ENDM 
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FFFA 
FFES 
GAGSD 


2280 


2800+1788 

0802+45E80028 
COBE+45EO0016 
OBDA+45E 20040 
OBBE+45E0098C 
@812+74880006 


O816+5QESOSEA 
OG1A+4120003E 
901 E+45E003D8 
O022+58EOO3SEA 
O826+04E8 


@028+5BEBO3EC 
@820+41200A0D 
0830+45E003D8 
0034+8827 

0036+45 EOO3D6 
OB3SA+SBESOSEC 
OO3E+O4EO 


0040+50EQOQ5EE 
0044441300405 
0048+41400080 
0640+45E00398 
@8950+8917 

6052+1851 

0054+16FF 

0856447580082 
885A+45E08398 
OO5E+941000FF 
@862+1651 

O064+16FF 

0066+47500082 
986A+595350000 
O86E+9A380002 
007 2+9B400082 
OO764+4750007E 
087 A+7408004C 
O87E+41508D00 
0882+59530080 
0086+ 58EOO3EE 
OO8A+O4E0 


OOB8C+SCECOSFS 
089 0+4140040E 
9094+58540000 
8898+55580408 


889C0+475088C2 
@G9A8+5550040A 
0@44+475001B0 
@@A8+5558046C 
@BAC+475002F2 
09 B0+45E80828 
O9B4+4120023F 
OOBB+45E003D8 
@OBC+SSEOOSFO 


$*MACRO 


we we wee we ee we 


; 

DATA: 
STATUS: 
CRLF: 

’ 


; 
BEGIN: 


MONLP: 


; 
PROMPT: 


; 
DOCRLF: 


° 
9 


GETIP: 


IPLP: 


DOEOF2: 
DOEOF: 


, 
SCANER: 


BY: 


MACLIB 
EQU 
EQU 
EQU 
ORG 
XR 
BAL 
BAL 
BAL 
BAL 
BX 


LI 
BAL 
LD 
BR 


LI 
BAL 
SRL 
BAL 
LD 
BR 


ST 
LI 
LI 
BAL 
SLL 
LR 
ORR 
BC 
BAL 
NI 
ORR 
ORR 
BC 
ST 
Al 
SI 
BC 
BX 


st 
LD 
BR 


ST 
LI 
LD 
CMP 
BC 


CMP 
BC 
CMP 
BC 
BAL 
LI 
BAL 
LD 
BR 


00CO+B4EG 
SIMPLE MONITOR FOR THE AMD BIGH-SPEED 16-BIT COMPUTER 
00C2+41480418 
JIM BRICK 
@0C6+45E00304 
MICRO29 QOCA+9460FFFO 
@FFFAH ;USART DATA PORT ADDRESS QOCE+41080010 
@FFFYBH ;USART CONTROL PORT ADDRESS 
G@AODE ;LINE-FEED, CARRIAGE RETURN 90D2+506804064 
@ @OD6+45 500110 
RO,RO SCLEAR RO GODA+41200020 
R14,R@,DOCRLF NEW LINE ON CONSOLE @ODE+45EGO3D8 
R14,R0,PROMPT  31/P PROMPT Q0E2+45E003D8 
R14,RO,GETIP $GET USERS I/P @0E6+45E00126 
R14,R@,SCANER  $DECODE & EXECUTE COMMAND @OEA+41200020 
RO, MONLP ;REPEAT LOOP FOREVER QOEE+45EO03D8 
QOF2+45E@015C 
R1i4,R@,@PROMPT SAVE RET 
. oe QOF6+45500028 
R2,’> ;PROMPT CHARACTER ‘> 
OOF A+58600404 
R14,RO,CRTOUT jPROMPT TO CRT 
QOFE+94600010 
Ri4,R@,@PROMPT jRESTORE RET 
@162+9BC 80001 
R14 ; 
0106+4740¢0D2 
R14,R8,@DOCRLF ;SAVE RET GISAF5QEQVSFO 
R2,CRLF i3CR/LF CODES O1GE+O4ER 
R14,R@,CRTOUT  30/P LF 
0110+50E003F2 
R2,8 3GET CR CODE 
0114+A867 
R14,R@,CRTOUT  30/P CR 
0116+45B0035E 
R14,R@,Q@DOCRLF RESTORE RET 
: 011A+8867 
R14 ; 
0110+45EQ035E 
R14,R0,Q@GETIP  $SAVE RET ereesnereore 
4+ 
R3, BUFFER ;A(I/P BUFFER) aheeeee 
R4, OBER ;MAX I/P COUNT 0126+50R003F4 
R14,RO,CETCHR jGET NEXT I1/P CHARACTER 012A+58700404 
R1,8 SPOSITION I/P CHAR TO HI BYTE 012E+41De0008 
R15,R15 sTEST RET CODE 0136+A867 
27,RO,DOROF ;T0 EOF IF RC = ZERO 0138+4550035E 
R14,RO,GETCAR s$NEXT CHARACTER @i3¢*6a67 
R1,00FFB ;SAVE ONLY I/P CHARACTER IN LO 0135+45EG235E 
R5,R1 ;COMBINE TWO BYTES FOR ONE WORD 0142+41200020 
R15,R15 $TEST RET CODE 0146+45Ee03D8 
2? ,R@,DOEOF ;T0 EOP IF RC = ZERO 014A+9a7 90002 
R5,R3,0 ;DATA TO I/P BUFFER 145+9BDG0001 
R3, 9082 $T0 NEXT BUFER SLOT aisssavaodise 
R4 , 0082 ;COUNT-2 0156+Senee3F4 
2Z?,R@ ,DOEOF2 ;STOP IF MAX I/P 015A+04E0 
Ra, IPLP SCONTINUE GETTING I/P 
R5,OD00H ;EOF APTER MAX LINE @15C+50EOOSF6 
R5,R3,0 ;DATA/EOF TO BUFFER 0160+56700404 
R14,R0,@GETIP j;RESTORE RET 0164+41D00006 
aa ; 8168+58670000 
@16C+A867 
R14,R@,@SCANER SAVE RET aiaekines 
R4, BUFFER 3A(I/P BUFFER) sioarasxeaicd 
R5, R40 3GET COMMAND (FIXED FORMAT) aa wena panenk 
R5,R@, DMPCMD ; D FOR DUMP? “isexeces 
27,R@, DUMP 3CO IF TRUE gupasiese 
R5,R@,STRCMD ; S FOR STORE? 617C+45E80196 
Z22,RO, STORE 3GO STORE IF TRUE @182+45E003D8 
R5,R@, JMPCMD 3 J FOR JUMP? 0184+94700002 
2?,R@,JUMP $0 JUMP IF TRUE 0188+9BD00001 
R14,R0,DOCRLF NEW LINE ON CRT @180+47480168 
R2,°?7° 3 2? 190+ S8EQO3FE 
R1i4,RO,CRTOUT UNKNOWN COMMAND 819440450 
R14,R@,@SCANER $RESTORE RET 
@196+942000FF 
R14 : - 


; 
DUMP: 


DMPLP: 


; 
TYPAD: 


; 
DMPOUT: 


DMPLPP: 


; 
TYPLIT: 


TYPLLP: 


; 
pOoCcIT: 


LI 
BAL 
NI 
ul 
st 
BAL 
LI 
BaL 
BAL 
BAL 
LI 
BAL 
BAL 
BAL 
LD 
Al 
SI 
BC 
LD 
BR 


RRL 
BAL 
SRL 
BAL 
LD 
BR 


LD 
LI 


RRL 
BAL 
SRL 
BAL 
LI 
BAL 
Al 
SI 
BC 
LD 
BR 


LD 
LI 
LD 
RRL 
LR 
BAL 
BAL 
SRL 
LR 
BAL 
BAL 
Al 
SI 
BC 
LD 
BR 


NI 
CI 


R4, BUFOP1 

R14 ,R@,CVADDR 
R6, OFFFOR 

R12 ,16 
R6,RO,DMPAD 
R14,R0,TYPAD 
R2,° ° jSPACE 
R14 ,RO,CRTOUT 
R14,R8,CRTOUT 
R14 ,R@,DMPOUT 
R2,° ° $SPACE 
R14,R@,CRTOUT 
R14,RO,TYPLIT 
Ri4,R@,DOCRLF 
R6, RO, DMPAD 
R6,16 

R12,1 

NZ? ,R@,DMPLP 
R14,R0,@SCANER 
R14 


R14 ,RO,@TYPAD 
R6,8 

R14,RG, BINOUT 
R6,8 

R14 ,RO,BINOUT 
R14,R@,@TYPAD 
R14 


R14 ,R@,@DMPOUT 
R7,R@,DMPAD 
R13,8 

R6 R72 

R6,8 
R14,R@,BINOUT 
R6,8 
R14,R0,BINOUT 
R2,° ° $SPACE 
R14,R0,CRTOUT 
R?7, 0862 

R13, 0061 

NZ? ,R@,DMPLPP 
R14,R0,@DMPOUT 
R14 


R14,R@,@TYPLIT 
R7,R@, DMPAD 
R13 ,8 

R6,R?,@ 

R6,8 

R2 ,RE 
R14,R0,DOCIT 
R14,R@,CRTOUT 
R6,8 

R2,R6 
R14,R6,DOCIT 
R14 ,RO,CRTOUT 
R7 0882 

R13,1 
N2?,R@,TYPLLP 
R14,R0,GTYPLIT 
R14 


R2,QOFFH 


° 


R2,° 


jA(ADDRESS PORTION OF BUFFER) 
sASCII ADDRESS TO BINARY IN R26 
;BEGIN ON EVEN WORD BOUNDRY 
30/P LINE COUNT 

SAVE CURRENT O/P ADDRESS 
;TYPE CURRENT CONTENTS OF RE 


;TO CRT 
$2 SPACES 
$PUT OUT ONE LINE OF DUMP DATA 


;TO CRT 

90/P LITERAL DATA 

sNEW LINE ON CRT 
+CURRENT DUMPOUT ADDRESS 
sADDRESS NEXT LINE 

+LINE COUNT ~1 

y;LOOP THRU O/P DATA 
sRESTORE RET 


. 
’ 


+SAVE RET 

;HI ADDRESS BYTE 
30/P 

*LO ADDRESS BYTE 
30/P 

*+RSTORE RET 


° 
’ 


sSAVE RET 

sGET O/P DATA ADDRESS 
30/P WORD COUNT 

sGET NEXT WORD 

sHI BYTE FIRST 

30/P 

3LO BYTE 

30/P 


>TO CRT 

>BUMP I/P DATA ADDRESS 
sWORD COUNT -1 

sLOOP THRU LINE 
sRESTORE RET 


SAVE RET 

3GET O/P DATA ADDRESS 

sWORD COUNT 

sNEXT O/P WORD 

sHI BYTE FIRST 

3TO O/P REG 

;CHECK FOR PRINTABLE CHARACTER 
3TO CRT 

3+GET LO BYTE 

TO O/P REG 

;CHECK FOR PRINTABLE CHARACTER 
3TO CRT 

3TO NEXT WORD 

sWORD COUNT ~1 

sLOOP THRU O/P LINE 

+RESTORE RET 


sGET LOW BYTE 
+BELOW BLANK? 


@19A+95290628 
Q19E+473001AA 
O1A2+9520007F 
O1AG+473EORBB 
@1AA+41200024E 
Q1AE+O4E0 


Q1B0+41400418 
81B4+45E 08304 
@1B&+58400406 
@1BC+17DD 

@1BE+45E001E6 
@1C2+58EOOSFO 
@106+9559680eD 
O1CA+475E0000 
O1CE+45EQ001FA 
@1D2+58EQ0SFO 
@1D6+9558888D 
@1DA+475 E0080 
Q@1DE+9A408802 
O1E2+740001BE 


O1E6+5OEOO3FA 
61EA+58540808 
@1EE+8857 

O1FV+45EG0218 
DIPS+5BEQOSFA 


O1FS+O4E9 


B1FA+S@EOOSFA 
O1FE+58540002 
@202+945200FF 
8206+45E00210 
O2CA+SEEOQSFA 
B2QE+B4ER 


@210+58EOQ3FC 
8214+45ES6235C 
0218+58 EGOSFC 
@210+475 E0808 
O220+45EBO25A 
8224447380232 
0228+45 E08 298 
022C+58EGO3SFC 
9230+0450 

8232+4158908D 
9236+ S8EGOO3FC 
O@23A+94ED 


8230+95590028 
0240+475E0800 
@244+9550002E 
6248+475 E8008 
624C+9558882C 


8250+475E6880 
8254+9558000D 
0258+84E9 


O25A+945000FF 
@25E+95588838 
@262+473 E2008 
0266+9558003A 
@26A+47388268 
O26 E+95500041 
O272+47SEQOO8 
0276+95588047 
027 A+47308284 


O27E+95S5OFFIF 
O282+04E0 


SETPER: 


Uwe 


TORE: 


STLP: 


; 
UPSTOR: 


5 
LOSTOR: 


5 
STDATA: 


SETND: 


’ 
CKDEL: 


; 
ASCHEX: 


VALPH: 


BC 
cI 
BC 


BR 


LI 
BAL 
LD 


XR 


BAL 


LD 
cI 
BC 
BAL 
LD 
cI 
BC 
Al 
BX 


st 
LD 
SRL 
BAL 
LD 
BR 


st 
LD 
NI 
BAL 
LD 
BR 


st 
BAL 
LD 
BC 
BAL 
BC 
BAL 
LD 
BR 
LI 
LD 
BR 


BC 
CI 
BC 
cI 
BC 
cI 
BR 


NI 
CI 
BC 
cI 
BC 
CI 
BC 
cI 
BC 
CI 
BR 
SI 


LT? ,RO,SETPER 
R2,007FH 
LT?,R14,0 
R257." 

R14 


R4, BUFOP1 

R14 ,R@,CVADDR 
R4,RG,DATAD 
R13,R13 
R14,R@,UPSTOR 
R14 ,RO,@SCANER 
RS, Q@ODH 
2?,R14,0 

R14 ,R@,LOSTOR 
R14,RO,@SCANER 
R5, OOODH 
27,R14,8 

R4, 0002 

Ro, STLP 


R14,R@,GUPSTOR 
R5 ,R4,8 

R5,8 
R14,R@,STDATA 
R14 ,R@,@UPSTOR 
R14 


R14,R@,G@UPSTOR 
R5,R4,0 
R5,OOFFH 
R14,R@,STDATA 
R14,R@,@UPSTOR 
R14 


Ri4,R@,@STDATA 
R14,R0,CKDEL 
R1i4,R@,@STDATA 
22,R14,8 
R14,RO,ASCHEX 
LT?,RO, SETND 
R14,R0,NIBBLE 
R1i4,R@,@STDATA 
R14 

R5, O@ODH 
R14,R0,@STDATA 
R14 


o 


R5,” 
27,R14,@ 
R5,°.° 
27, R14, 
RS,’,” 
2?,R14,0 
R5, O@ODH 
R14 


R5,@OFFH 
R5, O° 
LT?,R14,8 
R5,°:" 
LT?,R@,VNUM 
RS, ‘A* 

LT? ,R14,2 
R5,6047H 
LT? ,R@,VALPH 
R5,OFFFFH 
R14 

R5 ,8807H 


+SET PERIOD IF TRUE 

+BELOW DEL? 

;RET IF TRUE (CHAR PRINTABLE) 
+SET PERIOD AS CHARACTER TO PRI 


. 
y 


sACADDRESS FIELD) 

ASCII ADDRESS TO BINARY (IN R6 
GET CURRENT I/P DATA ADDRESS 
;CLEAR NIBBLE COUNT REG 
sUPPER BYTE FIRST 

sGET RET 

sEND? (CR = END) 

sRET IF TRUE 

sLOWER BYTE 

+GET RET 

yvEND? 

+RET IF TRUE 

3TO NEXT WORD 

sCONTINUE STORING DATA 


sSAVE RET 

sGET NEXT DATA 
sGET HI BYTE 
330 STORE BYTE 
sRESTORE RET 


sSAVE RET 

iGET DATA 
3KEEP LOW BYTE 
G0 STORE BYTE 
s+RESTORE RET 


. 
’ 


sSAVE RET 

sCHECK FOR DELIMITER 
+GET RET 

sRET IF RC = @ 
sASCII BYTE TO HEX NIBBLE 
5.NZ. RC = END 
+STORE THIS NIEBBLE 
sRESTORE RET 

; 

> FAKE EOF 

sRESTORE RET 


sSPACE? 

+RET IF TRUE 

> PERIOD? 

;RET IF TRUE 

3 COMMA? 

sRET IF TRUE 
;CARRIAGE RET? 
sLET CALLER DECIDE 


;LOW BYTE ONLY 
;LOWER THAN “2° ? 
;RET IF TRUE 

30-9 2 

;NUMERICAL IF TRUE 
LOWER THAN ‘A’ ? 
;RET IF TRUE 


+HEX ALPHA? 

+HEX ALPHA IF TRUE 
sSET .LT. CC 

; 


sASCII ADJUST 


€284+9B580087 
0288+9450000F 
@28C0+1555 
O28E+O4ER 


0292+58760008 
2294+16DD 
@296+474662AC 
@29A+9ADB0B01 
B29E+895B 
O2AG+947OOFFF 
@2A4+1675 
@2A6+58768900 
OZAA+O4ED 


@2AC+95D88001 
@2B0+474902C06 
@2B4+SADO00G1 
@2B8+6957 
O2BAt+9470F OFF 
@2BE+1675 
@208+58766008 
0204+84E8 


@2C 6+S5D80802 


O2CA+474002EO 
O2CE+9ADG0901 
@2D2+8955 
@20D4+947OFFOF 
@2D8+1675 
O2DA+507680988 
O2DE+O4EG 


O2EG6+17DD 
@2E2+9470FFFS 
@2E6+1675 
Q@2E8+56760008 
@2EC+9A680002 
O2FH+04EG 


O2F2+41400418 
O2F6+45EGO304 
O2FA+18F6 

B2FC+45EF GOOG 
@388+74806888 


@304+58E00482 
8388+1766 
@50A+58540080 
930E+8857 
@319+45E@G25A 
9314447488358 
@318+1665 
031 A+58540000 


O31E+45EG025A 
0322447499550 
@326+8963 
@326+1665 
O@32A+9A400802 
@S52E+58540088 
@332+8857 
0334+45EG0025A 
8338+47408354 
933C+8963 
O33E+1665 
0348+58540068 
O344+45E0025A 
0346+474086558 
834C+8963 


VNUM: 


; 
NIBBLE: 


, 
NXNIB1: 


; 
NXNIB2: 


; 
NXNIBS: 


, 
JUMP: 


’ 
CVADDR: 


NI 
CLR 
BR 


LD 
ORR 
BC 
Al 
SLL 
NI 
ORR 
st 
BR 


BC 
Al 
SLL 
NI 
ORR 
st 
BR 


BC 


Al 
SLL 
NI 
ORR 
sT 
BR 


NI 
ORR 
st 
Al 
BR 


LI 
BAL 
LR 
BAL 
BK 


ST 
XR 
LD 
SRL 
BAL 
BC 
ORR 
LD 


BAL 
BC 
SLL 
ORR 
Al 
LD 
SRL 
BAL 
BC 
SLL 
ORR 
LD 
BAL 
BC 
SLL 
ORR 


R5 , OOOFH 
R5,R5 
R14 


R7,R6,8 
R13,R13 

NZ? ,RO,NXNIB1 
R13 ,9891 
R5,12 

RV, OFFFH 
R7,R5 

R7,R6,@ 

R14 


R13 6001 

NZ? ,RO NXNIBZ 
R13 O21 

R5,8 

R?, OFOFFE 

R?7, RS 

R7,R6,@ 

14 


R13 ,082 
NZ?7,RO,NXINIBS 


R13 , 8801 
R5,4 
R7, OF FOFR 
R7,R5 
R7,RE,@ 
R14 


R13,R13 
R7, OFFFOH 
R7,R5 
R7,R6,8 
R6 , 8882 
R14 


R4, BUFOP1 
R14,R@,CVADDR 
R15,R6 
R14,R15,2 

RO, BEGIN 


R14 ,RO,@CVADDR 
R6 , R6 

R5,R4,0 

R5,8 

R14,R@, ASCHEX 
N2?,R@,CVHOUT 
R6,R5 

R5,R4,0 

R14 ,RG,ASCHEX 
NZ? ,RO ,CVHOUT 
R6,4 

R6,R5 

R4, 0802 
R5,R4,0 

R5,8 
R14,R@,ASCBEX ~ 
NZ?,R®,CVHOT1 
R6,4 

R6,R5 

R5,R4,0 
R14,R@,ASCHEX 
NZ?,R@,CVBOUT 
R64 

R6, RS 
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;LOW NIBBLE ONLY 
sRC = ZERO 


. 
’ 


sGET OLD DATA 

3R13 = ZERO 7? 

3TEST FOR ONE IF NOT TRUE 

;BUMP NIBBLE COUNTER 

;POSITION THIS NIBBLE 

;PREPARE OLD DATA FOR NEW NIBBL 
; INSERT NEW NIBBLE 

;DATA BACK TO MEMORY 


. 
? 


jNEXT NIBBLE? 

sTO NEXT IF NOT THIS 

;BUMP NIBBLE COUNTER 

;POSITION THIS NIBBLE 

;PREPARE OLD DATA FOR NEW NIBBL 
sINSERT NEW NIBBLE 

>DATA BACK TO MEMORY 


sNEXT NIBBLE? 
3TO NEXT IF NOT THIS 


3 BUMP NIBBLE COUNT 

sPOSITION THIS NIBBLE 

>PREPARE OLD DATA FOR NEW NIBBL 
sINSERT NEW NIBBLE 

sDATA BACK TO MEMORY 


° 
’ 


3LAST NIBBLE (LSN) 

;PREPARE OLD DATA FOR NEW NIBBL 
;INSERT NEW NIBBLE 

;DATA BACK TO MEMORY 

;BUMP MEM POINTER 


. 
’ 


3A (ADDRESS) 
sASCII ADDRESS TO BINARY ADDRES 
sADDRESS TO R15 


> JUMP... 


+BACK TO MONITOR IF CALLEE RETU 


+SAVE RET 

sCLEAR R6 

3GET TWO ADDRESS BYTES 
;UPPER BYTE FIRST 

sASCII BYTE TO HEX NIBBLE 
;STOP IF NOT HEX DATA 
+FIRST ADDRESS NIBBLE TO R6 
*GET ADDRESS BYTES AGAIN 


sASCII BYTE TO HEX NIBBLE 

sSTOP IF NOT HEX DATA 

sPOSITION ADDRESS FOR NEXT NIBB 
sINSERT NEXT ADDRESS NIBBLE 
>BUMP MEMORY PTR TO NEXT WORD 
sNEXT ASCII ADDRESS DATA 

sHIGH BYTE FIRST 

sASCII BYTE TO HEX NIBBLE 

sSTOP IF NOT HEX DATA 

sPOSITION ADDRESS FOR NEXT NIBB 
sINSERT NEXT NIBBLE 

3GET ADDRESS DATA AGAIN 

sASCII BYTE TO HEX NIBBLE 

s3STOP IF NOT HEX DATA 

sPOSITION ADDRESS FOR NEXT NIBB 
sINSERT NEXT NIBBLE 
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834E+1665 
Q358+9A400802 
8354450400466 
@358+58E20482 
@35C0+84E8 


Q39E+SGEQO3FE 
0362+1826 
0364+8823 
9366+9420000F 
@36A+45E 00386 
O36E+45E003D8 
037241826 
0374+9420000F 
0378+45E0038E 
037C+45EQO3D8 
0380+58E003FE 
0384+04E8 


Q38E+9528800A 
Q@38A+47FOO392 
Q3BE+GA2ZBB007 
@392+9A200830 


@396+04E0 


0398 +50E90400 
B39C+AG1 OF FFB 
0340494100082 
93A4+4758839C 
OSAS+AD1 OFFA 
B3AC+9410087F 
83B0+1821 

OSB2+45E0O3SD8 
Q3BE+1812 

O3B8+58E00400 
OSBC+41FQFFFF 


CVHOUT: 
CVHOT1: 


; 
BINOUT: 


; 
HEXEX: 


CON: 


’ 
GETCHR: 
RDCHR: 


AI 
ST 
LD 
BR 


ST 
LR 
SRL 
NI 
BAL 
BAL 
LR 
NI 
BAL 
BAL 
LD 
ER 


BC 
Al 
Al 
BR 


st 
IN 
NI 
BC 
IN 
NI 
LR 
BAL 
LR 
LD 
LI 
LI 


R4, 0802 
R4,R@,DATAD 
R14,R@,@CVADDR 
R14 


R14,RZ,@BINOUT 
R2,R6 

R2,4 

R2,OOGTH 
R14,R0,HEXEX 
R14,R@,CRTOUT 
R2,R6 

R2, OQOFH 
R14,RO,HEXEX 
R14,RG,CRTOUT 
R14,RO,@BINOUT 
R14 


R2,00@AH 
MI?,R0,CON 
R2 ,0007H 
R2,O@30H 


R14 


R14 ,R@,@GETCHR 
R1,R@ , STATUS 
R1 , 0802 
2?,R@,RDCHR 
R1,RO,DATA 

R1 ,007FH 

R2,R1 
R14,R@,CRTOUT 
R1,R2 
R14,R0,@GETCHR 
R15,-1 

R2, 00AH 


7TO NEXT MEMORY WORD 
>SAVE AS DATA ADDRESS 
sRESTORE RET 


sSAVE RET 

30/P BYTE TO R2 

sUPPEK NIBBLE FIRST 

sKEEP ONLY GOOD DATA 

+BINARY NIBBLE TO ASCII BYTE 
sNIBBLE (BYTE) OUT TO CRT 
30/P DATA TO R2 

+KEEP ONLY LOW NIBBLE 
+BINARY NIBBLE TO ASCII BYTE 
sAND OUT TO CRT 

s+RESTORE RET 


s;A-F ? 

;BR IF NOT TRUE 
sADJUST FOR A-F 
sMAKE ASCII 


. 
’ 


sSAVE RET 

>STRIP PARITY 

31/P READY? 

sLOOP UNTIL CHARACTER READY 
+READ DATA 

sKEEP ONLY DATA BYTE 
+DATA TO R2 

ECHO I/P 

+DATA BACK TO Ri 

+GET RET 

+SET R15 .NZ. 

+LF CODE IN CASE OF CR 


Q3C2+4120000A 
0304+9518608D 
23C5+474EQ O00 
@3CC+45E063D8 


@3D0+17FF 
@3D2+55ES 0400 
O3D6+O04EO 


@3D8+AG1GFFFB 
03DC+94108681 
O3EO+475863D8 
@SE4+A220FFFA 


O3E8+24EO 


OSEA 
O3EC 
@3EE 
O3FY 
@3F2 
O3F4 
@3F6 
O3F8 
OSFA 
@3FC 
O3FE 
0402 
0462 


0404 
8406 


04908 4428 
Q40A 5320 
@40C 4A20 


D40E 
04108 


2498 


@25A ASCHEX 
023C CKDEL 
@254 CVHOT1 
0408 DMPCMD 
0@28 DOCRLF 
0040 GETIP 
O1FA LOSTOR 
@2E@ NXNIBS 
Q1AA SETPER 
046A STRCMD 
@284 VALPH 
@3EC @LOCRLF 
O3FC @STDATA 


CRTOUT: 


? 
@PROMPT 
@DOCRLF 
@GETIP 
@SCANER 
@TYPAD 
@DMPOUT 
CTYPLIT 
@STORE 
GUPSTORK 
@STDATA 
@BINOUT 
@GETCHR 
@CVADDR 


, 
DMPAD: 
DATAD: 


, 

DMPCMD: 
STRCMD: 
JMPCMD: 


, 


; 
BUFFER: 
BUFOP1: 


cI 
BC 
BAL 
XR 


LD 
BR 


NI 
BC 
OUT 
BR 


DS 
DS 


DS 
DS 
DS 
DS 
DS 
DS 
DS 
DS 
DS 
DS 


DS 
DS 


DB 


DB 
DB 


DS 
DS 


END 


@6@2 BEGIN 
@392 CON 


8358 CVHOUT 
20D2 DMPLP 
@@82 DOEOF 


0386 HEX EX 
3806 MONLP 


8016 PROMPT 
FFFB STATUS 
9110 TYPAD 


8288 VNUM 


6402 @GGETCHR 
O3F8 @STORE 


R1,2@0DH 
N2?7,R14,8 
R14,R0,CRTOUT 
R15,R15 
R14,R@,@cETCER 
R14 


R1,RQ,STATUS 
R1 0001 

27, RO,CRTOUT 
R2,RO,DATA 
R14 


nn NNNNNNANNNNNN DA 


@35E BINOUT 
@AOD CRLF 
FFFA DATA 
@132 DMPLPP 
O@7E DOEOF2 
@84C IPLP 
8299 NIBBLE 
@39C RDCHR 
@219 STDATA 
615C TYPLIT 
O3FE @BINOUT 
OSEE CGETIP 
O3F2 @TYPAD 


+DATA = CR 2? 

sRET IF NO 

+DO LF IF PREVOIUS WAS CR 
sSET AC = ZERO FOR CR 
sRESTORE RETURN 


sGET STATUS BYTE 

sXMITTER EMPTY? 

sWAIT FOR XMITTER TO EMPTY 
30/P DATA TO CRT 


@40E BUFFER 
@3D8 CRTOUT 


0412 BUFOP1 
@304 CVADDR 


@406 DATAD 0404 DMPAD 
8126 DMPOUT @196 DOCIT 
@BC2 DUMP 8398 GETCHR 
@40C JMPCMD @2F2 JUMP 
@2aC NXNIBi @2C6 NXNIB2 
@88C SCANER 6232 SETND 
@1BE STLP @1B@ STORE 


8168 TYPLLP 
402 @CVADDR 
@3EA GPROMPT 
®3F6 @TYPLIT 


@1E6 UPSTOR 

O3F4 @DMPOUT 
@3F@ @SCANER 
O3FA GUPSTOR 
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APPENDIX D 


The System 29 operating system manages two floppy disk drives, 
AandB. The system will prompt with a A> or B> depending upon 
which disk the operator selects as the default. Generally, most 
system programs (editors, debuggers, compilers, etc.) are on the 


A disk and most user generated programs (source programs, 
user libraries, special assemblers, etc.) are on the B disk. In the 
following session, lower case letters are what the user typed-in, 
upper case letters are what System 29 responded, and com- 
ments (added as a tutorial) are in curly brackets. 


A>ed b: amd16bit.asm 

*e 

A>b: 

B>mac29 amd16bit $ab hb pb sb 


ASM29 VER. 1 0 
0490 

03BH USE FACTOR 
END OF ASSEMBLY 
B>a: 

A>ddt29 he 

A>set pa 3d 

A>ddt 

DDT VERS 1.4 
#ib:amd16bit.hex 
#18000 

NEXT PC END 

840E 0100 577F 
#TC 

A>\|bpm m29 wes cl ul de 1 
LOADING: M29.OBJ 


{call the editor to edit AMD16BIT.ASM from the B disk} 

{any program additions, changes, and/or deletions go here} 
{exit the editor and save the new AMD16BIT.ASM on the B disk} 
{switch to the B disk as default} 


{use the modified macro assembler (MAC29) to assemble AMD16BIT.ASM and put the 
HEX, PRINT and SYMBOL files back on to the B disk} 


{switch back to the A disk} 

{run DDT29, Halt the 16-bit computer's clock and Exit DDT29} 
{set the page register bit to enable the 16-bit computer main memory as 9080 upper 32k} 
{load 9080 DDT} 


» 


{reference the simple monitor's HEX file on the B disk} 
{read AMD16BIT.HEX into 9080 memory beginning at location 8000 HEX (upper 32k)} 


{exit DDT via control-C} 
{load the 16-bit computer’s microcode (phase-1 instruction set) } 


TITLE: MICROPROGRAM FOR 16-BIT COMPUTER 


VERIFYING: M29.OBJ 


TITLE: MICROPROGRAM FOR 16-BIT COMPUTER 


VERIFY COMPLETE 
A>ddt29 ir Ojr 


{run DDT29, set the instruction address register to zero (IR 0), jam the address on to the 
microprogram address bus (J), and run the 16-bit computer's clock (R)} 


set to 9600 baud and connected to console USART can now 
exercise the simple monitor. 


At this point, the AMD 16-bit high speed computer is running 
phase 1 instruction set in microcode and the simple monitor in 
target machine language in 16-bit main memory. A CRT terminal 
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APPENDIX E 


Memory Board 


The 16-Bit Computer Main Memory board was organized with 8k 
by 16-bit RAM section and a 2k by 16-bit ROM section. The RAM 
section occupies address 0-8k while the ROMs are assigned 
addresses 8k through 10k. The memory word consists of two 
bytes. The least significant address line specified whether high or 
low byte but is not used in the word mode. The address value from 
the computer is captured in a register at the beginning of the 
cycle; however, the most significant address lines are routed 
straight from the bus to the clock decode logic to make an early 
decision as to whether the memory board has been selected. 


In the word mode, the read and write transfers are straight for- 
ward. For the byte read mode, data is output on bus bits BDo.7 
while BDg.;5 are forced to zero. During byte write mode bus bits 
BDo.7 are duplicated internally on lines Do.7 and lines Dg_145. The 
signals WRHIGH or WRLOW select which byte in the RAM 
memory ts effected. 


The control logic generates the bus control line sequencing re- 
quired by the 16-Bit Computer. The memory read and write timing 
is shown in Figures E1 and E2. The bus controller function is 
simulated for the purposes of the prototype. Bus Request is 
clocked into a flip-flop and Bus Acknowledge is returned to the 


computer. The Memory Request signal from the computer ini- 
tiates amemory cycle. Fifty nanoseconds later the memory board 
responds with Address Accept. The computer then follows this 
with Data Request. The memory board responds with Data Sync 
and 50 nanoseconds later the data read out of the memory is 
clocked into the output registers and output on the data bus. 
Looking at the memory read timing diagram, it is seen that a read 
cycle is initiated with Memory Request but the data is not sent 
back to the computer until the beginning of the next microcycle. 


The write cycle is extended one oscillator cycle. This is necessary 
with the Am9124 RAMs because the data are not sent to the 
memory board until Data Request goes active (see Figure E2), 
which is 100 nanoseconds into the write cycle. With the clocked 
handshaked memory protocol of the 16-Bit Computer, this is 
easily done by delaying Data Sync one oscillator cycle. Since 
normally a computer performs many more read than writes, this 
impacts throughput only slightly. 


Additional logic was appended to allow the memory to be ac- 
cessed by the System 29 microprogramming development sys- 
tem. The Map Page (MAPP) of System 29 was used to specify the 
memory. The logic interfaces the control signals required by 
System 29 and the 16-Bit Computer Memory board. With this 
logic, the System 29 user can readily read or write into the 
memory. 
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Figure E1. Memory Read Timing. 
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Figure E2. Memory Write Timing. 
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AC performance, worst case, 106 
AC propagation delay paths, longest, 33 
Access time of memory, read, 5 
Accumulator configuration , ALU, 96-99 
Accumulators, 3 
Add, BCD, 157, 159 
microcode, 166-167 
ADD DIRECT microword, 123-125 
ADD IMMEDIATE microword, 123, 124 
ADD instruction: 
description of, 349 
steps for, 98 
ADD microword, 123, 124 
ADD RRi1 microword, 123, 125 
ADD WITH CARRY instruction description, 349 
Adder: 
carry lookahead, four-bit, 94-95 
full (see Full adder) 
Address, effective, 7 
Address-based microinstruction cycle, 35 
Address compare mode: 
Am2940, 242 
Am2942, 251 
Address counter: 
Am2940, 241 
Am2942, 250 
Address-data-based microinstruction cycle, 35, 37 
Address-instruction-based microinstruction cycle, 
35, 36 
Address-instruction-data-based microinstruction 
cycle, 35-37 
Address line control, 240 
Address maintenance, 240 
Address multiplexer, 15 
Address output buffers, Am2940, 242 
Address register: 
Am2940, 242 
Am2942, 250 


INDEX 


Addressing, 7 
direct, 7, 265 
immediate, 7, 265 
indexed, 264, 281 
relative, 4—5 
stack, 265 
Addressing modes, HEX-29, 264-265 
ALU (see Arithmetic & Logic Unit) 
Am25LS168 decade counter, 47 
Am25LS2521, 306 
Am25LS377 eight-bit registers, 39 
Am26S02, 306 
Am27S29 512 x 8 fuse-link PROM, 288-290 
Am2900 family, 7, 61 
Am2900 Learning and Evaluation Kit, 260 
Am2900 system, Am2914 in, 215-220, 229-231 
Am2901A arithmetic logic unit/function generator, 
93, 98-101 
Am2903 compared with, 106, 113 
architecture, 99-101 
arithmetic operation speed computations, 
108 
logic operation with shift speed computations, 
109 
logic operation speed computations, 108 
magnitude only arithmetic operation with shift 
down speed computations, 110 
microinstruction control, 101 
as program control unit, 203, 204 
propagation delays, combinational, 126 
set-up and hold times, 126 
switching characteristics of, 125, 126 
timing analysis summary, 144 
two’s complement arithmetic operation with 
shift down speed computations, 109 
Am2901B ALU/register sets, 290-298 
input bus, 298, 300, 301 
output bus, 298, 299 
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-Am2902 carry lookahead generator, 214-215 
logic diagram, 216 
Am2902A carry lookahead generator, 93 —96, 201 
Am2903 four-bit expandable bipolar microproces- 
sor slice, 101 
ALU destination control, 180 
Am2901A compared with, 106, 113 
architecture, 101-104 
ALU, 102 
ALU destination control summary, 104 
ALU functions, 103, 180 
ALU shifter, 102-103 
instruction decoder, 104 
output buffers, 104 
Q Register, 103-104 
special functions, 103, 180 
two-port RAM, 102 
arithmetic operation with 16-bit speed computa- 
tions, 111 
block diagram, 102 
data bus cascading, 106 
example, 220, 221 
general description, 101 
- logic operation with shift speed computations, 
111 
logic operation speed computations, 110 
magnitude only arithmetic operation with shift 
down speed computations, 112 
microprogram control bits, 115, 122 
microroutines with, 146-159 
microword ADD, 123, 124 
microword ADD DIRECT, 123-125 
microword ADD IMMEDIATE, 123, 124 
microword ADD RR1, 123, 125 
microword FETCH, 122-124 
microword INIT, 122, 123 
mnemonics, 180 
programming, 104 
propagation delays, combinational, 127 
RAM address cascading, 106 
registers, expanding the number of, 105-106 
sample microroutines, 122-125 
set-up and hold times, 127 
16-bit design, 113-125 
switching characteristics of, 125, 127 
timing analysis summary, 144 
two’s complement arithmetic operation with shift 
down and 16-bit speed computations, 112 
Am2904 status and shift control unit, 131 
AMDASM Phase 1 and Phase 2 listings of micro- 
programs, 168-179 
arithmetic operation and 16-bit speed computa- 
tions, 141 
arithmetic operation speed computations, 138 
arithmetic operation two's complement with shift 
down and 16-bit speed computations, 142 
assumed set-up time, 135 
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BCD add, 157, 159, 166-167 

BCD hardware additions, 154-155 

BCD to binary conversion, 155-156, 164-165 

Binary to BCD conversion, 156-158, 164-165 

bit operations, 132 

block diagram, 131 

carry-in control multiplexer instruction codes, 134 

carry-in multiplexer, 132 

condition code multiplexer, 132 

condition code output instruction codes, 134 

CPU hardware diagram, 182-187 

criteria for comparing two numbers following “A 
minus B”’ operations, 134 

double-length normalize command, 146-149, 
162-163 

load operations, 132 

logic operation with shift speed computations, 
139, 141 

logic operation speed computations, 138, 140 

machine register condition code output, 181 

machine status register, 131 

machine status register instruction codes, 132, 
181 

magnitude only arithmetic operation with shift 
down speed computations, 140, 142 

microprogram structure, 144-145 

microregister condition code output, 181 

microstatus register, 131 

microstatus register instruction codes, 132, 181 

mnemonics, 181 

non-restoring binary roots, 153-154, 162-163 

normalization microroutine, 146-147 

pipelined microprogram bits, 144-146 

preliminary switching characteristics, 135 

register operations, 132 

sample microroutines, 146-159 

shift linkage multiplexer instruction codes, 133 

shift linkage multiplexers, 131-132 

16-bit, 135, 144 

standard device Schottky speeds, 135 

status registers, 131 

timing analysis, 135 

two’s complement arithmetic operation with shift 
down speed computations, 139 

two’s complement division, 150-153, 160-161 

two’s complement multiplication, 150, 160-161 

unsigned multiply, 147, 149, 160-161 

Y output instruction codes, 135 


Am2909 microprogram control sequencer, 33, 38 


computer control unit using, 42-45 


Am2910 microprogram sequencer, 9, 17, 229, 231, 


232, 330 
architecture, 17-18 
block diagram, 231 
calculations on, 24-27 
complete CCU using, 43, 48-49 


Am2910 microprogram sequencer (Cont.): 
computer control unit architecture using, 22-38 
initializing, 38 
instruction set, 19-22 

table, 18 

Am2911 microprogram sequencer, 15-17, 220, 222 
in computer control unit, 39-42 
propagation delay calculations for, 24, 28-31 

Am2914 Vectored Priority Interrupt controller, 

211-235 
in Am2900 system, 215-220, 229-231 
block diagram, 212-214 
cascading, 214-215 
logic symbol, 212 
microinstruction set, 212-213 
programming, 212-213 
registers, 213-214 
time delay using, 220, 224 —229 

Am2918 four-bit register, 10, 42, 47 

Am2922 eight-input multiplexer, 24-32, 47 

Am2930 program control unit, 201-204 
block diagram, 201 
parallel look-ahead expansion scheme for, 202 
ripple expansion scheme for, 202 

Am2940 DMA address generator, 241-249 
architecture, 241-247 
control modes, 242-243 
example design, 247-249 
general description, 241 
instructions, 243 
timing, 243-247 

Am2942 Programmable Timer/Counter, DMA Ad- 

dress Generator, 249-256 
architecture, 249-251 
block diagram, 250 
control modes, 25% 
example designs, 252, 254-256 
function table, 253 
general description, 249 
instructions, 251-253 

Am29705 16-word by 4-bit two-port RAM, 105-106 
block diagram, 105 

Am29761 265-word 4-bit PROM, 39 

Am29803A 16-way branch control unit, 43 
computer control unit using, 42-45 
function table, 46 

Am29811A next address control unit, 15-17 

_ in computer control unit, 39-42 
instruction set, 16 
instruction set difference from Am2910 instruc- 

tion set, 22 
propagation delay calculations for, 24, 28-31 

Am74S$138, 306 

Am74S158 Two-input Multiplexer, 42 

Am74S175, 42 

Am74S181 four-bit arithmetic logic unit/ function 

generator, 96, 97 
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Am74S8240, 306 
Am74S373, 306 
Am74S374 registers, 290, 291, 301, 305 
Am9080A type microprocessor, 10, 239 
CRT controller and, 62 
AMD 16-Bit Computer design (see Super Sixteen) 
AMD System/29 (see System/29) 
AMDASM, 10 
definition file, 330, 336-337 
and assembly file, 79-87 
Phase 1 and Phase 2 listings of Am2904 micropro- 
grams, 168-179 
AND BYTE instruction description, 351 
AND instruction description, 349 
Applications, microprogram state machine, 
33-34 
Architecture: 
Am2901A, 99-101 
Am2903 (see Am2903, architecture) 
Am2940, 241-247 
Am2942, 249-251 
computer, 2-10 
microcomputer, block diagram of, 10 
microprogrammed, 10 
possible state machines, 34-37 
figure, 35-37 
subroutine stack, 200 
Super Sixteen CPU, 321-330 
logic diagrams, 365-384 
Arithmetic & Logic Unit (ALU), 3-6, 93 
Am2908, 102 
destination control, Am2903, 180 
destination control summary, Am2903, 104 
functions, Am2908, 180 
multi-register, 98 
shifter, 102-103 
Super Sixteen, 321-323, 366-367 
Arithmetic & Logic Unit/accumulator configuration, 
96-99 
Arithmetic & Logic Unit/register sets, Am2901B, 
290-298 
Arithmetic & Logic Unit/three register machine, op- 
eration of, 98-99 
Arithmetic operation with 16-bit speed computa- 
tions: 
Am2908, 111 
Am2904, 141 
Arithmetic operation speed computations: 
Am2901A, 108 
Am2904, 138 


Arithmetic operation two’s complement with shift 


down and 16-bit speed computations, Am2904, 
142 
Arithmetic operations, 3 
HEX-29, 266-267 
Arithmetic Processor Unit, 6 
Asynchronous events, 207 


390 


BCD add, 157, 159 
microcode, 166-167 
BCD conversion, binary to, 156-158 
microcode, 164-165 
BCD hardware additions, Am2904, 154-155 
BCD to binary conversion, 155-156 
microcode, 164-165 
Binary conversion, BCD to, 155-156 
microcode, 164-165 
Binary roots, non-restoring, 153-154 
microcode, 162-163 
Binary to BCD conversion, Am2904, 156-158 
microcode, 164-165 
Bipolar microprogram controller, 17 
Bit operations, Am2904, 132 
Bit slice timing, 106-113 
Block diagram: 
Am2903, 102 
Am2904, 131 
Am2910, 231 
Am2914, 212-214 
Am2930, 201 
Am2942, 250 
Am29705, 105 
DMA peripheral controller, 248 
HEX-29, 278, 280, 282-285 
microcomputer architecture, 10 
Branch and link flow chart, 339, 341 
BRANCH AND LINK instruction description, 355 
Branch and stack instruction, 200 
Branch code, HEX-29 microprogram sequencer, 288 
BRANCH instruction, 14 
BRANCH instruction description, 355 
BRANCH ON CONDITION instruction description, 
355 
BRANCH ON INDEX instruction description, 355 
BRANCH operation, 4 
Branches, conditional, 261-262 
Bulk Memory, 6 
BYTE SWAP instruction description, 351 
Bytes, 8-bit, 191 


Call executive instruction, 275 

CALL instruction description, 352 

Call interrupt service routine microprogram, 220, 
223 

Carry flag, 263 

Carry generate, 94 

Carry-in, 94 

Carry-in control multiplexer instruction codes, 134 

Carry-in multiplexer, Am2904, 132 

Carry lookahead adder, four-bit, 94-95 

Carry lookahead generator, Am2902A, 93-96, 201 

Carry method usages, ripple, 95 

Carry output flag, 99 


Carry propagate, 94 
Cascading the Am2914, 214-215 
CCU (see Computer Control Unit) 
Central processing unit architecture, Super Sixteen, 
321-330 
logic diagrams, 365-384 
Central Processor Unit (CPU), 3-7 
hardware diagram, Am2904, 182-187 
HEX-29 (see HEX-29 CPU) 
with internal high speed registers, 3, 4 
read timing, Super Sixteen, 325 
Centralized DAM, 240 
Channels, 275-276 
Clock, system: 
Am2940, 242 
Am2942, 251 
8-phase, 8 
HEX-29, 278, 281, 286 
Clock control, Super Sixteen, 322, 324, 374-375 
Clock pulse (CP), 262 
numbered, 66 
COMPARE instruction description, 350 
COMPARE LOGICAL BYTE instruction description, 
351 
COMPARE LOGICAL instruction description, 350 
COMPARE LONG instruction description, 353 
Computer, stored-program, 3-7 
Computer architecture, 2-10 
(See also Architecture) 
Computer basics, 3-5 
Computer control flow diagram, 34 
Computer control function flow diagram, 9 
Computer Control Unit (CCU), 5-10 
architecture, 15-17 
using Am2909 and Am29803A, 42-45 
using Am2910, 22-38, 43, 48-49 
using Am2911 and Am29811A, 39 
using Super Sixteen, 322 
set-up for high-speed micro-level interrupt han- 
dling, 232 
timing, 23-33 
Computer data path, three register, 97 
Computer design, AMD 16-bit (see Super Sixteen) 
Condition code input, 202 
Condition code multiplexer, 15 
Am2904, 132 
Condition code output instruction codes, Am2904, 
134 
Condition code register, HEX-29, 263-264 
Condition select multiplexer, 261 
Conditional branch flow chart, 339, 340 
Conditional branches, 261-262 
CONDITIONAL JUMP PIPELINE instruction, 19, 
21-22 
figure, 20 
CONDITIONAL JUMP REGISTER/COUNTER or 
PIPELINE instruction, 21 


CONDITIONAL JUMP REGISTER/COUNTER or 
PIPELINE instruction (Cont.): 
figure, 20 
Conditional jump speed computations, 24, 25, 28, 29 
CONDITIONAL JUMP-TO-SUBROUTINE instruc- 
tion, 19, 21 
figure, 20 
CONDITIONAL JUMP VECTOR instruction, 21 
figure, 20 
Conditional jumping, 13-15 
Conditional operation, 4 
Conditional push speed computations, 32 
CONTINUE instruction, 22 
figure, 20 
CONTINUE (CONT) statement, 13 
Control modes, Am2940, 242-243 
Control register: 
Am2940, 241 
Am2942, 250 
Control store, 260 
Counter register, microprogram, 15 
CP (clock pulse), 66, 262 
CPU (see Central Processor Unit) 
CRT controller: 
AMDASM definition and assembly files, 79-87 
complete wiring diagram for, 50, 52-53 
design of, 47, 50-61 
display formats accommodated on, 78-87 
logic diagram, 47, 51 
of interface circuit for, 62, 63 
microprogram for, 50, 54-56 
principle of operation, 47, 50 
software emulation, 66-67 
of System 29 universal card, 88 
timing considerations, 50, 57-61 
wiring diagrams, 62, 64-65 
Cycle steal method, 240 


D bus, 97 
D input, 101 
Data-Based microinstruction cycle, 35, 36 
Data bus cascading, Am2903, 106 
Data formats, 191 
Data movement, HEX-29 mnemonics, 267 
Data movement capabilities, 7 
Data multiplexer: 
Am2940, 242 
Am2942, 251 
Data path, 92-187 
Super Sixteen, 324, 370-371 
three register computer, 97 
Data routing, 261-262 
Data transfer control, 240 
Dead page, 274 
DECIMAL ADD instruction description, 353 
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DECIMAL SUBTRACT instruction description, 353 
DECREMENT INDEXES instruction description, 
353 
Defined register class of instructions, 265 
Definition file, AMDASM, 330, 336-337 
Delay, time, using Am2914, 220, 224-229 
Delay path, longest signal, 23-24 
Delays, 6 
Depth, memory, 5 
Depth-over-width (d/w) ratio of memory, 10 
Design, microprogrammed, 12-61 
Direct addressing, 7, 265 
Direct memory access (DMA), 6, 238-256 
centralized, 240 
control, HEX-29, 301, 305 
distributed, 240 
implementation, 240 
I/O system, 240 
peripheral controller block diagram, 248 
repetitive, 240 
Direct memory access (DMA) Address Generator, 
Am2940, 241-249 
Direct memory access (DMA) Controller, 240 
Disk drive management, System 29 operating sys- 
tem, 361 
Distributed DMA, 240 
Dividend in divide operations, 151. 
Division, two’s complement, 150-153 
microcode, 160-161 
Divisor in divide operations, 151 
DMA (see Direct memory access) 
Double-length normalize command, 146-149 
microcode, 162-163 
Double words, 32-bit, 191 
d/w (depth-over-width) ratio of memory, 10 


Effective address, 7 

Emulation, software, of CRT controller, 67-77 

Enable control, extended, 42 

Enable stack signal (FILE ENABLE), 15 

Engineering model Super Sixteen, 346-348 

EXCHANGE BYTE instruction description, 351 

EXCHANGE PROGRAM STATUS instruction de- 
scription, 351 

EXECUTE instruction description, 353 

Execution of microinstructions, 13 

Executive interrupts, 207 

Exponent, signed, 191 

Extended enable control, 42 


F bus, 97 
FETCH, overlapping or pipelining, 14-15 
FETCH instruction, 4 
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FETCH microword, 122-124 
Fetch routine, 220, 223, 231 
Fields, microinstruction, 13 
FILE ENABLE (enable stack signal), 15 
Fixed point numbers, 191 
Floating point numbers, 191 
Flow diagram: 
computer control, 34 
computer control function, 9 
read control, 249 
FORMATTING, 13 
Fraction, signed, 191 
Full adder: 
basic, understanding, 93-99 
four-bit ripple-carry, 93 
truth table, 93 
Full adder cells, cascaded, 93 
Full stack, 19 
Function logic, 262 


General purpose (GP) computer, 8 
General register class of instructions, 265 
Generate, carry, 94 

GP (general purpose) computer, 8 


HAL (HEX-29 Assembly Language), 264 
Half sign flag, 263 
HEX-29 Assembly Language (HAL), 264 
HEX-29 CPU, 258-315 
addressing modes, 264-265 
arithmetic operations, 266-267 
block diagram, 278, 280, 282-285 
carry-in control, 290 
condition code control, 290, 298 
condition code register, 263-264 
DMA control, 301, 305 
DMA/refresh control, 275-276 
features, 259-260 
general specifications, 263-275 
instruction matrix, 273-274 
instruction set, 265-274 
internal CPU registers, 263 
interrupt control, 301-304 
interrupt.structure, 275 
macro instructions, 268-272 
microcode, 307-315 
microprogram control, 281, 287-288 
microprogram sequence branch code, 288 
microword memory, 288-290 
operating system for timesharing (HOST), 275 
shift and rotate linkage, 290 
system bus, 276-277 
system clock, 278, 281, 286 
system design goals, 259 


HEX-29 CPU (Cont.): 
system timing, 277-281 
HEX-64KBS static memory card, 306 


ICU (interrupt control units), 215, 219, 220 
immediate addressing, 7, 265 
Immediate instruction flow chart, 337, 339 
Indexed addressing, 264, 281 
indirect addressing, 7 
INIT microword, 122, 123 
Initializing the Am2910, 38 
Input, vector, 42-43 
Input bus, Am2901B, 298, 300, 301 
INPUT BYTE instruction description, 354 
Input instruction flow chart, 340, 341 
Inp ut/output (see I/O) 
INPUT WORD instruction description, 354 
INSERT CHARACTER instruction description, 350 
Instruction-Based microinstruction cycle, 35 
Instruction control speed computations, 27, 30 
Instruction-data-based microinstruction cycle, 35, 
36 
Instruction Decoder, 9 
Am2903, 104 
Am2940, 242 
Am2942, 251 
Instruction descriptions, Super Sixteen, 349-355 
Instruction Enable pin, 202 
Instruction formats, 191 
HEX-29, 264 
Super Sixteen, 321 
Instruction matrix, HEX-29, 273-274 
Instruction register, 7 
Instruction set, 7 
Am29811A, 16 
HEX-29, 265-274 
Instruction types, 191-197 
memory immediate instruction, 197 
memory to memory indexed instruction, 196 
memory to memory instruction, 194 
register immediate instruction, 196 
register to indexed memory instruction, 195 
register to memory immediate instruction, 195- 
196 
register-to-memory-reference instruction, 193- 
194 
register-to-register (RR) instructions, 191-193 
register with short-immediate instruction, 194— 
195 
Instructions, 3 
Am2940, 243 
Am2942, 251-253 
defined register class of, 265 
executed sequentially, 198 
executing, 4-5 


Instructions (Cont.): 
general register class of, 265 
Super Sixteen, 319-321 
(See also Microinstructions) 
Interface circuit for CRT controller, 62, 63 
Intermediate slice (IS), 102 
Internal high speed registers, CPU with, 3, 4 
Interprocessor interrupts, 207 
Interrupt, 206-235 
Super Sixteen, 324, 376-377 
Interrupt acknowledge, 208 
Interrupt control, HEX-29, 301-304 
Interrupt control units (ICU), 215, 219, 220 
Interrupt Controller, 6 
Interrupt driven I/O, 239 
Interrupt example, microprogram level, 229-235 
Interrupt handling: 
computer control unit set-up for high-speed 
micro-level, 232 
sequence of events for, 207-208 
Interrupt masking, 207-208 
Interrupt microprogram, return, 220, 223, 231 
Interrupt nesting, 210 
Interrupt priority encoder, 210-211 
Interrupt recognition, 207 
interrupt registers, HEX-29, 264 
Interrupt request: 
instruction flow during, 220, 223 
multiple, 209 
daisy chain acknowledge, 209 
single: daisy chain acknowledge, 208 
multiple poll, 208 
Interrupt request clearing, 210 
Interrupt request handling, multiple, 210 
Interrupt request masking, dynamic, 210 
Interrupt request prioritization, 210 
Interrupt request priority threshold, 211 
Interrupt Return instruction, 208 
Interrupt sequence timing, 234, 235 
Interrupt service routine, 208 
Interrupt service routine microprogram, call, 220, 
223 
Interrupt service routine nesting, 211 
Interrupt structure, 208-209 
general purpose, 210-211 
HEX-29, 275 
Interrupts: 
machine versus microprogram level, 207 
microprogram, 43 
priority schemes in, 209-210 
types of, 207 
Intraprocessor interrupts, 207 
Intrasystem interrupts, 207 
Invalid access block, 274 
Invalid instruction trap, 275 
I/O (input/output), 239 
devices, 3,6 
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/O (input/output) (Cont.): 
DMA, 240 
Super Sixteen, 324, 376-377 
write timing, 328 


1S (intermediate slice), 102 


JMP (JUMP) instruction, 13 
JUMP, UNCONDITIONAL, 14-15 
JUMP and ZERO (JZ) instruction, 19 
figure, 20 
JUMP (JMP) instruction, 13 
JUMP MAP instruction, 19 
figure, 20 
Jump map speed computations, 26, 29, 30 
JUMP operation, 4 
JUMP-TO-ONE-OF-TWO-BRANCH-ADDRESSES in- 
struction, 17 
JUMP-TO-ONE-OF-TWO-SUBROUTINES instruc- 
tion, 15, 17 
JUMP-TO-SUBROUTINE instruction, 15, 200 
Jumping: 
conditional, 13-15 
microprogram, 13 
JZ (JUMP and ZERO) instruction, 19, 20 


Last-in first-out (LIFO) stacking arrangement, 199 
Latch bypass, 220 
Latency times, 6 
Least significant slice (LSS), 102 
LIFO (last-in first-out) stacking arrangement, 199 
LOAD BYTE instruction description, 350 
LOAD COUNTER AND CONTINUE instruction, 22 
figure, 20 
LOAD instruction description, 349 
Load operations, Am2904, 132 
LOAD PROGRAM STATUS WORD instruction de- 
scription, 351 
Load select control function, 14 
LOAD STACK instructions, description of, 352 
Logic diagram: 
Am2902, 216 
Super Sixteen CPU, 365-384 
Logic operation with shift speed computations: 
Am2901A, 109 
Am2903, 111 
Am2904, 139, 141 
Logic operation speed computations: 
Am2901A, 108 
Am2903, 110 
Am2904, 138, 140 
Logic symbol, Am2914, 212 
Logical address, 273-274 
Logical data, 191 
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Logical operations, 3 

HEX-29 mnemonics, 267 
Lookahead adder, four-bit carry, 94-95 
LSS (least significant slice), 102 


Machine interrupts versus microprogram level inter- 
rupts, 207 
Machine level instructions, microprogram instruc- 
tions versus, 9-10 
Machine register condition code output, Am2904, 
181 
Machine status register, Am2904, 131 
Machine status register instruction codes, Am2904, 
132, 181 
Machines, microprogrammed, 260-262 
versus non-microprogrammed, 8-9 
Macro assembler disable opcode patch, 347 
Macro instructions, HEX-29, 268-272 
(See also Instructions) 
Macro library, 347, 356-357 
Mag nitude only arithmetic operation with shift down 
speed computations: 
Am2903, 110, 112 
Am2904, 140, 142 
Mapping PROM, 15, 17 
MAR (Memory Address Register), 3-6 
Mask bus, 220 
MDR (Memory Data Register), 3-6 
Memory: 
Bulk, 6 
depth-over-width ratio of, 10 
microprogram, 8, 13 
program to write into, 88-89 
random access (see Random Access Memory) 
read access time of, 5 
Memory access: 
direct (see Direct memory access) 
random (see Random Access Memory) 
Memory Address Register (MAR), 3-6 
Memory addressing scheme: 
with PC in ALU, 193 
with PC outside ALU, 197 
Memory board, Super Sixteen, 362, 378 —-381, 384 
Memory control, Super Sixteen, 322, 324, 374-375 
Memory Data Register (MDR), 3-6 
Memory depth, 5 
Memory immediate instruction, 197 
Memory management, HEX-29, 273-274 
Memory management registers, HEX-29, 264 
Memory mapped I/O, 239 
Memory mapping program address, 273 -274 
Memory read timing, Super Sixteen, 363 
Memory to memory indexed instruction, 196 
Memory to memory instruction, 194 
Memory width, 5 


Memory write timing, Super Sixteen, 364 
Microcode: 
branch and stack instruction, 200 
memory immediate instruction, 197 
memory to memory indexed instruction, 196 
memory to memory instruction, 194 
register immediate instruction, 196 
register short-immediate instruction, 195 
register to indexed memory instruction, 195 
register to memory immediate instruction, 195 
register to memory immediate instruction im- 
proved, 198 
register-to-memory-reference instruction, 194 
register-to-register instruction, 193 
register-to-register instruction with overlap of 
execute and PC control, 198 
return-from-subroutine instruction, 200 
Super Sixteen, 337-345 
Microcode translation, Super Sixteen, 345, 346 
Microcomputer: 
HEX-29 (see HEX-29 CPU) 
16-bit (see Super Sixteen) 
Microinstruction control, Am2901A, 101 
Microinstruction cycle, 35-37 
Microinstruction fields, 13 
Microinstruction format, Super Sixteen, 330-337 
Microinstruction set, Am2914, 212-213 
Microinstructions, 13 
execution of, 13 
(See a/so Instructions) 
Micromachine, 7, 8-10 
Microprogram: 
Am2904 AMDASM Phase 1 and Phase 2 listing of, 
168-179 
AMDASM definition and assembly files, 79-87 
CRT controller, 50, 54-56 
Super Sixteen, 340, 342-345 
Microprogram control, 260-261 
HEX-29, 281, 287-288 
Microprogram control bits, Am2903, 115, 122 
Microprogram controller, bipolar, 17 
Microprogram counter (mPC), 18 
Microprogram counter register, 15 
Microprogram execution, timing diagram of, 37 
Microprogram instructions, machine level instruc- 
tions versus, 9-10 
Microprogram interrupt, 43 
Microprogram jumping, 13 
Microprogram level interrupt: 
example of, 229-235 
versus machine interrupts, 207 
Microprogram memory, 8, 13 
Microprogram sequencer, 260-261 
Microprogram start-up flow chart, 337 
Microprogram state machine applications, 33-34 
Microprogram structure, Am2904, 144-145 
Microprogrammed architecture, 10 


Microprogrammed design, 12-61 
key features of, 13 
Microprogrammed machines, 260-262 
Microprogramming, 259 
Microprogramming control, subroutining in, 15 
Microregister condition code output, Am2904, 181 
Microroutines, sample: 
Am2908, 122-125 
Am2904, 146-159 
Microstatus register, Am2904, 131 
Microstatus register instruction codes, Am2904, 
132, 181 
Microword ADD, 123, 124 
Microword ADD DIRECT, 123-125 
Microword ADD IMMEDIATE, 123, 124 
Microword ADD RRA1, 123, 125 
Microword FETCH, 122-124 
Microword INIT, 122, 123 
Microword memory, HEX-29, 288-290 
Microword register, 14 
Mode control, 240 
Monitor listing, Super Sixteen, 347, 358-360 
Most significant slice (MSS), 102 
MOVE LONG instruction description, 353 
mPC (microprogram counter), 18 
MSS (most significant slice), 102 
Multiplexer (MUX): 
address, 15 
condition code, 15 
data, 242, 251 
three-input, 99 
Multiplication, two’s complement, 150 
microcode, 160-161 
MULTIPLY instruction description, 350 
Multiply unsigned instruction, 147, 149 
microcode, 160-161 
MULTIPLY UNSIGNED instruction description, 350 
Multiprocessor, interrupts in, 207 
MUX (see Multiplexer) 


N output, 102 
Negative numbers, 191 
Negative single-length number, normalized and un- 
normalized, 147 
Nested subroutine example, 199 
Nesting: 
of interrupt service routines, 211 
of interrupts, 210 
Non-polling versus polling systems, 207 
Non-restoring binary roots, 153-154 
microcode, 162-163 
Normalization microroutine, 146-147 
Normalized negative single-length number, 147 
Normalized positive number, 146 
Numerical value of zero, 191 
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OP CODE (operation code), 7,9 
Operands, 3 
for operations, 7 
Operating system, System 29, disk drive manage- 
ment, 361 
Operation code (OP CODE), 7,9 
Operations, operands for, 7 
OR BYTE instruction description, 351 
OR instruction description, 349 
Output buffers, Am2903, 104 
Output bus, Am2901B, 298, 299 
OUTPUT BYTE instruction description, 354 
Output flags, 99 
Output instruction flow chart, 340, 341 
OUTPUT WORD instruction description, 354 
Overflow, 95 
Overflow detect output flag, 99 
Overflow detection signal (OVR), 93, 101, 102 
Overlap of execute and PC control, 198 
Overlapping: 
FETCH, 14-15 
Super Sixteen, 322 
OVR (overflow detection signal), 93, 101, 102 


Parallel cascade mode, 214-217 
Parallel look-ahead expansion scheme for Am2930, 
202 
PCU (see Program Control Unit) 
Phase 1 and phase 2 periods, 277-278 
Physical page zero, 273 
Pin functions, 17 
Pipeline registers, 14-15, 260-261, 290, 291 
Pipelined microprogram bits, Am2904, 144-146 
Pipelined operations, Super Sixteen, 329-330 
Pipelining: 
FETCH, 14-15 
Super Sixteen, 322 
Polling, 6 
versus non-polling systems, 207 
POP instruction description, 352 
POP operation, 18 
Positive numbers, 191 
normalized and unnormalized, 146 
P/POP instruction description, 352 
P/PUSH instruction description, 352 
Priority schemes in interrupt, 209-210 
Program, 3 
to write into character memory, 88-89 
Program control, HEX-29 mnemonics, 267 
Program Control Unit (PCU), 4-6, 190-204 
Am2901A as, 203, 204 
counter-type, 4 
Super Sixteen, 321, 368-369 
Program control unit performance, improving, 
197-203 


396 


Program steps, 4 
Programmed 1/O, 239 
Programming, 7 
Am2914, 212-213 
PROM, 8 
mapping, 15, 17 
Propagate, carry, 94 
Propagation delay calculations: 
on Am2910 microprogram sequencer, 24-27 
for Am2911 and Am29811A design, 24, 28-31 
Propagation delays, combinational: 
Am2901A, 126 
Am2903, 127 
PUP (push/pop control), 15 
PUSH/CONDITIONAL LOAD COUNTER instruction, 
19-21 
figure, 20 
PUSH instruction description, 352 
PUSH operation, 18 
Push/pop control (PUP), 15 


Q input, 101 
Q register, 99 
Am2903, 103-104 


R input field, 99 
Random Access Memory (RAM), 8, 99 
address cascading, Am2903, 106 
shift network, 99 
two-port, 102 
write enable (RAM EN), 99 
Read access time of memory, 5 
Read control flow chart, 249 
Register immediate instruction, 196 
Register operations, Am2904, 132 
Register with short-immediate instruction, 194-195 
Register to indexed memory (RX) instructions, 195 
addressing, 7 
flow chart of, 337, 338 
sequence of, Super Sixteen 328 
Register to memory immediate instruction, 195-196 
Register-to-memory-reference instruction, 193-194 
Register-to-register (RR) instructions, 191-193 
addressing, 7 
flow chart of, 337, 338 
sequence of, Super Sixteen, 328 
Registers: 
Am2914, 213-214 
pipeline, 14-15, 260-261, 290, 291 
Working, 3 
Relative addressing, 4-5 
Reliability, system, 277 
Remainder, true value of, 151 


REPEAT LOOP, COUNTER = ZERO instruction, 21 
figure, 20 
REPEAT PIPELINE REGISTER, COUNTER ZERO, 
21 
figure, 20 
Repetitive DMA, 240 
RESET instruction, 18, 19 
figure, 20 
Restore after interrupt service routine, 208 
Return-from-interrupt sequence timing, 234, 235 
Return-from-subroutine command, 198 
RETURN-FROM-SUBROUTINE instruction, 15, 21 
figure, 20 
Return-from-subroutine instruction microcode, 200 
RETURN instruction description, 352 
Return interrupt microprogram, 220, 223, 231 
Return register, 198 
Ripple carry method usages, 95 
Ripple cascade mode, 214-217 
Ripple expansion scheme for Am2930, 202 
Ripple propagation time, 93 
ROTATE LEFT instruction description, 354 
ROTATE RIGHT instructions, description of, 354 
Rotating structure interrupt scheme, 209-210 
RR instructions [see Register-to-register (RR) in- 
structions] 
RX instructions [see Register to indexed memory 
(RX) instructions] 


S (sum output), 93 
S input field, 99 
S/29 (see System/29) 
Save status, 207 
Schottky speeds, standard device, 107 
Am2904, 135 | 
SET, CLR, COMPLEMENT, TEST BIT PSW instruc- 
tion description, 352 
Set-up and hold times: 
Am2901A, 126 
Am2903, 127 
Shift and rotate instruction flow chart, 339, 
341 
Shift-down operations, 115 
SHIFT LEFT instructions, description of, 354 
Shift linkage multiplexer instruction codes, 
Am2904, 133 
Shift linkage multiplexers, Am2904, 131-132 
Shift network, RAM, 99 
SHIFT RIGHT instructions, description of, 353 
Shift-up operations, 115 
Shifter, 3, 4 
Sign bit output flag, 99 
Signal delay path, longest, 23-24 
Single-length normalize command, 146-148 
microcode, 162-163 


Slice: 
intermediate (1S), 102 
least significant (LSS), 102 
most significant (MSS), 102 
Software, 7 
Software emulation of CRT controller, 66-77 
SP (stack pointer), 4, 15, 18 
SSI/MSI, 135, 143 
Stack, full, 19 
Stack addressing, 265 
Stack and link, 198 
Stack pointer (SP), 4, 15 
built-in, 18 
Standard device Schottky speeds, 107 
Am2904, 135 
Static structure interrupt scheme, 209 
Status bus, 220 
Status registers, Am2904, 131 
STORE BYTE instruction description, 350 
STORE CHARACTER instruction description, 350 
STORE instruction description, 349 
STORE PROGRAM STATUS WORD instruction de- 
scription, 351 
STORE STACK instructions, description of, 352 
Stored-program computer, 3-7 
Subroutine example, nested, 199 
Subroutine stack architecture, 200 
Subroutining, 198-202 
in microprogramming control, 15 
SUBTRACT instruction description, 349 
SUBTRACT WITH CARRY instruction description, 
349 
Sum output (S), 93 
Super Sixteen, 318-384 
ALU, 321-323, 366-367 
central processing unit architecture, 321-330 
logic diagrams, 365-384 
central processing unit read timing, 325 
clock and memory control, 322, 324, 374-375 
computer control unit, 322 
data path, 324, 370-371 
engineering model, 346-348 
instruction descriptions, 349-355 
instruction format, 321 
instructions, 319-321 
interrupt and I/O, 324, 376-377 
I/O write timing, 328 
macro library, 347, 356-357 
memory board, 362, 378-381 , 384 
memory read timing, 363 
memory write timing, 364 
microcode, 337 -345 
microcode translation, 345, 346 
microinstruction format, 330-337 
microprogram, 340, 342-345 
monitor listing, 347, 358-360 
pipelined operations, 329-330 
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Super Sixteen (Cont.): 
RR instruction sequence, 328 
RX instruction sequence, 328 
S/29 WCS interface, 382-384 
system organization, 319 
SUPERVISOR CALL instruction description, 
352 
Switching characteristics: 
Am2901A, 125, 126 
Am2903, 125, 127 
preliminary Am2904, 135 
Sync control logic, 232, 233 
System reliability, 277 
System/29, 10, 345, 347-348 
operating system disk drive management, 361 
universal card, 88 
WCS interface, Super Sixteen, 382-384 


TEST END-OF-LOOP instruction, 22 
figure, 20 
TEST IMMEDIATE instruction description, 350 
THREE-WAY BRANCH instruction, 22 
figure, 20 
Time delay using Am2914, 220, 224-229 
Timing: 
Am2940, 243-247 
bit slice, 106-113 
CCU, 22-33 
HEX-29 system, 277-281 
interrupt sequence, 234, 235 
return-from-interrupt sequence, 234. 235 
Timing analysis, Am2904, 135 
Timing analysis summary, 144 
Timing considerations, CRT controller, 50, 57-61 
Timing diagram of microprogram execution, 37 
Transfer complete circuitry: 
Am2940, 242 
Am2942, 251 
TRANSLATE AND TEST instruction description, 
353 
TRANSLATE instruction description, 352 
Trap, 275 
Truth table, full adder, 93 
Two-port RAM, 102 
Two’s complement arithmetic operation with shift 
down and 16-bit speed computations, Am2903, 
112 
Two’s complement arithmetic operation with shift 
down speed computations: 
Am2901A, 109 
Am2904, 139 
Two’s complement division, 150-153 
microcode, 160-161 
Two's complement multiplication, 150 
microcode, 160-161 
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Unconditional branch flow chart, 339 
UNCONDITIONAL JUMP, 14-15 
Unconditional operation, 4 
Unnormalized negative single-length number, 147 
Unnormalized positive number, 146 
Unsigned multiply, 147, 149 

microcode, 160-161 


Vector bus, 220 
Vector generator, 232, 233 
Vector input, 42-43 


Width, memory, 5 
Word count compare mode: 
Am2940, 242 
Am2942, 251 
Word count equals zero mode, Am2942, 251 
Word count maintenance, 240 
Word count register: 
Am2940, 242 
Am2942, 251 
Word counter: 
Am2940, 242 
Am2942, 251 


Word counter carry out mode: 
Am2940, 243 
Am2942, 251 
Word lengths, 191 
Words, 5 
16-bit, 191 
Working Registers, 3 
Write-protect bit (WP), 273 


XOR BYTE instruction description, 351 
XOR instruction description, 350 


Y output instruction codes, Am2904, 135 
Y outputs, 101 
three-state, 18 


Zero, numerical value of, 191 
Zero detect output flag, 99 
ZERO instruction, 43 
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